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20 

21 BACKGROUND INFORMATION 

22 The Maximus and Internet Growth 

23 As the Internet grows, the volume of data traversing the various networks that 

24 comprise the Internet has increased tremendously. While packet-based technology is 

25 widely believed to be the dominant technology going forward, in the short term, no single 

26 technology has emerged as the clear winner in the race to satisfy the seemingly insatiable 

27 demand for bandwidth, which is fueled by the spectacular growth of the Internet. 

28 At the same time, carriers respond to customers' demand for new applications by 

29 rolling out increasingly sophisticated data services, some of which are highly delay- 

30 sensitive and/or bandwidth-intensive (e.g., voice or video). The introduction of these 

3 1 differentiated services opens up opportunities for competing system vendors to introduce 

32 new technologies and/or protocols. As a consequence, changes in the networks are 

33 characterized not only by vastly increased bandwidth requirements but also by a highly 

34 diverse array of technologies and protocols. 

35 The aforementioned changes impact the access, core/long-haul, and 

36 metro/regional networks in different ways. In the foreseeable future, the demand for 

37 high speed access (i.e., in the megabit to low gigabit range) appears to be satisfied by a 

38 multitude of competing technologies, e.g., xDSL, cable modem. Frame Relay, Tl, T3, 

39 ATM, and Gigabit Ethernet. Some of the newer technologies such as xDSL and cable 

40 modem have made impressive gains, enabling users to transmit and receive data at vastly 

41 increased data rates, thereby facilitating higher-bandwidth applications, some which were 

42 impractical only a few years ago when dial-up or lower speed ISDN connections were the 

43 dominant access technologies. 

44 Corresponding changes also occur in the core/long-haul networks. In the core 

45 network, the emergence of an all-optical core and DWDM technologies have made it 

46 practical to transfer terabits of data across long-haul trunks at substantially lower costs. 
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1 However, all-optical technologies have not progressed to the point where it is possible to 

2 process packets and/or cells in the optical domain. Processing intelligence is still very 

3 much the province of electronics. Accordingly, an all-optical core, when operated in its 

4 pure optical mode, is substantially devoid of processing intelligence, such as the type 

5 required to correctly perform end-to-end routing of a packet or cell from one user to 

6 another. Without resorting to the electrical plane, it is currently not practical to offer 

7 differentiated data services (e.g. , routing different streams of traffic in different ways in 

8 accordance to their classes of service) via all-optical technologies. 

9 The metro/regional network bears the full brunt of the powerful changing forces 

10 that affect the access and core networks. As an aggregation mechanism for access 

1 1 networks, the metro/regional network must move to higher speeds to cope with the 

12 increased bandwidth requirements. The metro/regional network must also interoperate 

13 with legacy and emerging transmission technologies and protocols, as well as adapt the 

14 divergent technologies and protocols to its own transmission requirements and the 

1 5 requirements of the core network. The metro/regional network must also perform 

16 extensive cell and packet processing, as these tasks are no longer performed by the 

17 optical core network. To facilitate differentiated service offerings, the metro/regional 

18 network must also handle traffic management for different data streams having different 

19 classes of service. In short, the metro/regional network of tomorrow is characterized by 

20 higher speeds, diverse transmission technologies and protocols, and enhanced processmg 

21 requirements. 
22 

23 Capabilities of the Maximus 

24 The Maximus is designed to capitalize on these trends. It is a high integration 

25 ASIC designed for high speed switching and routing systems. The Maximus also finds 

26 applications in access devices operating at OC-192, or as an aggregation device in the 

27 core network. In a typical line card deployment, the Maximus is implemented between 

28 an OC-192 or a quad OC-48 multi-service framer and the switch fabric. 
29 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

3 1 The present invention is illustrated by way of example, and not by way of 

32 limitation, in the Figures of the accompanying drawings and in which like reference 

33 numerals refer to similar elements and in which: 
34 

35 Figure 1 is a figure showing multi-service system. 

36 Figure 2 is a figure showing OC-192 line card for traffic manager application, 

37 Figure 3 is a figure showing OC-192 line card for traffic manager application, 

38 Figure 4 is a figure showing various applications of the maximus chipset along with the 

39 type field, 

40 Figure 5 is a figure showing traffic manager applications. 

41 Figure 5A is a figure showing uses of the Maximus chip with cell-based switch fabric. 

42 Figure 6 is a figure showing SAR applications, 

43 Figure 6A is a figure showing uses of the Maximus chip with p acket-based switch fabnc. 

44 Figure 7 is a figure showing ingress tm (ATM=>ATM), 

45 Figure 8 is a figure showing ingress tm (ATM=> MPLS packet), 

46 Figure 9 is a figure showing ingress tm (MPLS packet=>ATM), 
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1 Figure 10 is a figure showing ingress tm (packet=>packet)^ 

2 Figure 1 1 is a figure showing ATM encapsulation^ 

3 Figure 12 is a figure showing reassembly. 

4 Figure 13 is a figure showing ingress packet bypass^ 

5 Figure 14 is a figure showing egress tm (ATM=>ATM)^ 

6 Figure 15 is a figure showing egress tm (ATM=> MPLS packet)^ 

7 Figure 16 is a figure showing egress tm (MPLS packet =>ATM)^ 

8 Figure 17 is a figure showing egress tm (packet=>packet)^ 

9 Figure 18 is a figure showing ATM de-encapsulatiom 

10 Figure 19 is a figure showing segmentation. 

1 1 Figure 20 is a figure showing egress packet bypass^ 

12 Figure 21 is a figure showing Maximus chipset top level block diagram, 

13 Figure 22 is a figure showing signals, 

14 Figure 23 is a figure showing block addresses, 

15 Figure 24 is a figure showing read access and timings definitions, 

16 Figure 25 is a figure showing read access characteristic parameters, 

17 Figure 26 is a figure showing write access and timings definitions, 

18 Figure 27 is a figure showing write access characterisitcs parameters, 

19 Figure 28 is a figure showing interface from the CPU if to the blocks^ 

20 Figure 29 is a figure showing global sync, 

21 Figure 30 is a figure showing slot_count and CPU_port signals, 

22 Figure 31 is a figure showing interface of the CPU if block, 

23 Figure 32 is a figure showing internal implementation for locking the current CPU 

24 access, parameters, 

25 Figure 33 is a figure showing test mux block diagram, 

26 Figure 34 is a figure showing soft resets, 

27 Figure 35 is a figure showing input/output disable, 

28 Figure 36 is a figure showing registers, 

29 Figure 37 is a figure showing incoming spi-4 interface, 

30 Figure 38 is a figure showing input control block in spii. 

3 1 Figure 39 is a figure showing output control block in spii, 

32 Figure 40 is a figure showing bit align block, 

33 Figure 41 is a figure showing drop idle state machine, 

34 Figure 42 is a figure showing control word format, 

35 Figure 43 is a figure showing descriptions of fields in the control words, 

36 Figure 44 is a figure showing control word type list, 

37 Figure 45 is a figure showing rx state machine transition table, 

38 Figure 46 is a figure showing extent of dip-4 codewords, 

39 Figure 47 is a figure showing example of dip-4 encoding (odd parity), 

40 Figure 48 is a figure showing example of a port calendar, 

41 Figure 49 is a figure showing fifo status state machine, 

42 Figure 50 is a figure showing example of dip-2 encoding (odd parity), 

43 Figure 51 is a figure showing summary of internal memories, 

44 Figure 52 is a figure showing interface ports^ 

45 Figure 53 is a figure showing register description, 

46 Figure 54 is a figure showing outgoing spi-4 interface, 
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1 Figure 55 is a figure showing training sequence. 

2 Figure 56 is a figure showing tx state machine transition table. 

3 Figure 57 is a figure showing extent of dip-4 codewords^ 

4 Figure 58 is a figure showing example of dip-4 encoding (odd parity)^ 

5 Figure 59 is a figure showing example of transfer data byte ordering, 

6 Figure 60 is a figure showing tx state machine block, 

7 Figure 61 is a figure showing port calendar, 

8 Figure 62 is a figure showing status interface to database, 

9 Figure 63 skew detection, 

10 Figure 64 is a figure showing seg_table - 64 x 26 bits internal dual-port SRAM^ 

1 1 Figure 65 is a figure showing interface, 

12 Figure 66 is a figure showing timing diagram-reassembly port empty^ 

13 Figure 67 is a figure showing timing diagram-back-to-back port transfer, 

14 Figure 68 is a figure showing register description, 

15 Figure 69 is a figure showing software interface registers^ 

16 Figure 70 is a figure showing read outgoing spi start-up parameters, 

17 Figure 71 is a figure showing write outgoing spi start-up parameters, 

18 Figure 72 is a figure showing read outgoing spi start-up parameters, 

19 Figure 73 is a figure showing write outgoing spi start-up parameters, 

20 Figure 74 is a figure showing read outgoing spi start-up parameters, 

21 Figure 75 is a figure showing write outgoing spi start-up parameters, 

22 Figure 76 is a figure showing write port parameters after initialization, 

23 Figure 77 is a figure showing read port parameters after initialization, 

24 Figure 78 is a figure showing read port full parameters, 

25 Figure 79 is a figure showing write port full parameters^ 

26 Figure 80 is a figure showing read control status register, 

27 Figure 81 is a figure showing write spio_tstmux_sel, 

28 Figure 82 is a figure showing read spio_tstmux_sel, 

29 Figure 83 is a figure showing lookup engine and neighboring blocks, 

30 Figure 84 is a figure showing top level block diagram of the lookup engine, 

31 Figure 85 is a figure showing data dependency on sop cell followed none sop cell on the 

32 same port, 

33 Figure 86 is a figure showing header selection, 

34 Figure 87 is a figure showing label hashing & record retrieval, 

35 Figure 88 is a figure showing port parameters, it is 64 entries, 

36 Figure 89 is a figure showing encap table structure, it is 8 entries, 

37 Figure 90 is a figure showing port type, 

38 Figure 91 is a figure showing record memory, it is 2meg entries per memory, 

39 Figure 92 is a figure showing parameter width. 

40 Figure 93 is a figure showing external interface signals^ 

41 Figure 94 is a figure showing external interface signal names and direction^ 

42 Figure 95 is a figure showing global signals, 

43 Figure 96 is a figure showing CPU interface to look up engine, 

44 Figure 97 is a figure showing memory size, 

45 Figure 98 is a figure showing memory size bits, 

46 Figure 99 is a figure showing memory chip select. 



4 



AZA-OOl/2001-POO, 




PATENT 



1 Figure 100 is a figure showing memory chip select values^ 

2 Figure 101 is a figure showing address space^ 

3 Figure 102 is a figure showing registers addresses. 

4 Figure 103 is a figure showing nop command. 

5 Figure 104 is a figure showing read lut memoryl command. 

6 Figure 105 is a figure showing write lut memoryl command. 

7 Figure 106 is a figure showing read lut memory2 command. 

8 Figure 107 is a figure showing write lut memory2 command^ 

9 Figure 108 is a figure showing read port memory command^ 

10 Figure 109 is a figure showing write port memory conmiand^ 

1 1 Figure 1 10 is a figure showing setup connection conmiand for key command. 

12 Figure 1 1 1 is a figure showing teardown connection command for key command^ 

13 Figure 1 12 is a figure showing get hash FID command^ 

14 Figure 1 13 is a figure showing get hash FID response^ 

15 Figure 1 14 is a figure showing read cam memoryl command. 

16 Figure 1 15 is a figure showing write cam memory L 

17 Figure 1 16 is a figure showing read cam memory2^ 

18 Figure 1 17 is a figure showing write cam memory2^ 

19 Figure 1 18 is a figure showing init all memories. 

20 Figure 1 19 is a figure showing init memory 1 & cam L 

21 Figure 120 is a figure showing init memory 2 & cam 2. 

22 Figure 121 is a figure showing ATM types. 

23 Figure 122 is a figure showing ATM MPLS tag location & format^ 

24 Figure 123 is a figure showing MPLS format. 

25 Figure 124 is a figure showing lookup types^ 

26 Figure 125 is a figure showing header format^ 

27 Figure 126 is a figure showing data position within the header format. 

28 Figure 127 is a figure showing header substitution^ 

29 Figure 128 is a figure showing special header structure. 

30 Figure 129 is a figure showing encap table structure. 

31 Figure 130 is a figure showing the encapsulation header replacement in 12 case. 

32 Figure 131 is a figure showing hashing and memory accesses^ 

33 Figure 132 is a figure showing hash timing and lookup operations. 

34 Figure 133 is a figure showing types of segmentation and applications for various traffic 

35 types. 

36 Figure 134 is a figure showing segmentation engine and neighboring blocks^ 

37 Figure 135 is a figure showing spi-4 data flow^ 

38 Figure 136 is a figure showing type 1 segmentation. 

39 Figure 137 is a figure showing type 2 segmentation. 

40 Figure 138 is a figure showing type 3 segmentation^ 

41 Figure 139 is a figure showing type 4 segmentation. 

42 Figure 140 is a figure showing status cell access flow. 

43 Figure 141 is a figure showing CPU access flow^ 

44 Figure 142 is a figure showing top-level block diagram for segmentation engine^ 

45 Figure 143 is a figure showing error conditions^ 

46 Figure 144 is a figure showing summary of internal memories^ 
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1 Figure 145 is a figure showing seg^table - 65 x 39 bits internal dual-port SRAM 

2 Figure 146 is a figure showing free_buffer - 80 x 7 bits internal SRAM. 

3 Figure 147 is a figure showing q_fifo - 80 x 70 bits internal SRAM, 

4 Figure 148 is a figure showing data SRAM - 640 x 64 bits internal SRAM, 

5 Figure 149 is a figure showing crc_table - 65 x 32 bits internal dual-port SRAM, 

6 Figure 150 is a figure showing statistic SRAM - 65 x 80 bits internal SRAM, 

7 Figure 151 is a figure showing system, 

8 Figure 152 is a figure showing segmentation O lookup, 

9 Figure 153 is a figure showing segmentation O memory manager, 

10 Figure 154 is a figure showing segmentation <^^^ PF(i 

11 Figure 155 is a figure showing segmentation O sch, 

12 Figure 156 is a figure showing segmentation O CPU interface, 

13 Figure 157 is a figure showing timing diagrams, 

14 Figure 158 is a figure showing memory access, 

15 Figure 159 is a figure showing registers, 

16 Figure 160 is a figure showing surrimary of commands, 

17 Figure 161 is a figure showing CPU inject packet, 

18 Figure 162 is a figure showing read seg fbl memory, 

19 Figure 163 is a figure showing write seg fbl memory, 

20 Figure 164 is a figure showing read seg crc memory, 

21 Figure 165 is a figure showing write seg crc memory, 

22 Figure 166 is a figure showing read seg statistic memory, 

23 Figure 167 is a figure showing write seg statistic memory, 

24 Figure 168 is a figure showing read seg queue memory, 

25 Figure 169 is a figure showing write seg queue memory, 

26 Figure 170 is a figure showing read seg data memory, 

27 Figure 171 is a figure showing write seg data memory, 

28 Figure 172 is a figure showing read seg tbl memory, 

29 Figure 173 is a figure showing write seg tbl memory, 

30 Figure 174 is a figure showing memory manager block diagram, 

3 1 Figure 175 is a figure showing internal memory manager block diagram, 

32 Figure 176 is a figure showing multicast and BID/imBED release, 

33 Figure 177 is a figure showing enqueue process in the internal memory manager, 

34 Figure 178 is a figure showing dequeue process in the internal memory manager, 

35 Figure 179 is a figure showing SDRAM manager block diagram, 

36 Figure 180 is a figure showing memory controller configuration, 

37 Figure 181 is a figure showing summary of memories, 

38 Figure 182 is a figure showing summary of CAM. 

39 Figure 183 is a figure showing segmentation engine, 

40 Figure 184 is a figure showing per flow queue engine, 

41 Figure 185 is a figure showing DDR-SDSRAM, 

42 Figure 186 is a figure showing reassembly, 

43 Figure 187 is a figure showing CPU interface, 

44 Figure 188 is a figure showing global sync, 

45 Figure 189 is a figure showing memory size is a figure showing 64 mbytes, 

46 Figure 190 is a figure showing memory size is a figure showing 256 mbytes. 
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1 Figure 191 is a figure showing register description. 

2 Figure 192 is a figure showing CPU_wr_SDRAM, 

3 Figure 193 is a figure showing CPU_rd_SDRAM^ 

4 Figure 194 is a figure showing CPU_wr_ fiml. 

5 Figure 195 is a figure showing CPU_rd_fimL 

6 Figure 196 is a figure showing CPU_wr_cam. 

7 Figure 197 is a figure showing CPU_rd_cam. 

8 Figure 198 is a figure showing CPU_init_mm. 

9 Figure 199 is a figure showing inputs from segmentation^ 

10 Figure 200 is a figure showing interface with SDRAM bw optimizer is a figure showing 

1 1 SWM requests to write to SDRAM. 

12 Figure 201 is a figure showing interface to SDRAM bw optimizer is a figure showing 

13 write to SDRAM. 

14 Figure 202 is a figure showing interface with per flow queue is a figure showing receive 

15 de-queue from PFQ, 

16 Figure 203 is a figure showing interface with SDRAM bw optimizer is a figure showing 

17 SRM requests to read from SDRAM. 

18 Figure 204 is a figure showing interface with SDRAM bw optimizer is a figure showing 

19 SRM read from SDRAM^ 

20 Figure 205 is a figure showing interface with reassembly. 

21 Figure 206 is a figure showing applications for various traffic types. 

22 Figure 207 is a figure showing reassembly engine and neighboring blocks. 

23 Figure 208 is a figure showing type 1 reassembly (ingress memory cells => switch cells^ 

24 Figure 209 is a figure showing type 1 reassembly (aal5 memory cells => switch cells)^ 

25 Figure 210 is a figure showing type 1 reassembly (aal5 like memory cells => switch 

26 cells), 

27 Figure 21 1 is a figure showing type 2 reassembly (52 byte ATM cells => packet), 

28 Figure 212 is a figure showing type 3 reassembly (48 byte ATM cells => packet), 

29 Figure 213 is a figure showing type 4 reassembly (switch cells => packet), 

30 Figure 214 is a figure showing type 5 reassembly (52 byte ATM cells => ATM cells with 

31 translation), 

32 Figure 215 is a figure showing type 6 reassembly (48 byte ATM cells => packet), 

33 Figure 216 is a figure showing type 7 reassembly (48 byte cells => AAL-5 cells), 

34 Figure 217 is a figure showing type 8 reassembly (switch cells => packet), 

35 Figure 218 is a figure showing top level block diagram for the reassembly engine, 

36 Figure 2 1 9 is a figure showing free buffer list block diagram, 

37 Figure 220 is a figure showing crc_gen block diagram, 

38 Figure 22 1 is a figure showing ATM MPLS tag location & format, 

39 Figure 222 is a figure showing MPLS format, 

40 Figure 223 is a figure showing ATM types, 

41 Figure 224 is a figure showing seg_table - 64 x 18 bits internal single-port SRAM, 

42 Figure 225 is a figure showing output_control_memory - 128 x 18 bit internal SRAM, 

43 Figure 226 is a figure showing output_port_queue - 64 x 18 bit internal SRAM, 

44 Figure 227 is a figure showing free_buffer - 128 x 7 bit internal SRAM, 

45 Figure 228 is a figure showing data_SRAM - 1024 x 64 bit internal dual port SRAM, 

46 Figure 229 is a figure showing header_SRAM - 256 x 64 bit internal dual port SRAM, 
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1 Figure 230 is a figure showing CPU_data_SRAM - 36 x 64 bit internal dual port SRAM^ 

2 Figure 231 is a figure showing crc_table - 64 x 32 bits internal dual-port SRAM. 

3 Figure 232 is a figure showing statistic SRAM - 64 x 1 12 bit internal SRAM, 

4 Figure 233 is a figure showing look-up SRAM - 4m x 36 bit external SRAM, 

5 Figure 234 is a figure showing flow header format, 

6 Figure 235 is a figure showing control word formats, 

7 Figure 236 is a figure showing Maximus header format, 

8 Figure 237 is a figure showing ATM header format. 

9 Figure 238 is a figure showing MPLS format, 

10 Figure 239 is a figure showing aal5 trailer format, 

1 1 Figure 240 is a figure showing system, 

12 Figure 241 is a figure showing reassembly « memory manager. 

13 Figure 242 is a figure showing reassembly O SPL 

14 Figure 243 is a figure showing reassembly <^CPU, 

15 Figure 244 is a figure showing internal memory manager to reassembly, 

16 Figure 245 is a figure showing reassembly to spi out - port empty and back to back, 

17 Figure 246 is a figure showing timing diagram-back-to-back port transfer. 

1 8 Figure 247 is a figure showing memories, 

19 Figure 248 is a figure showing error conditions, 

20 Figure 249 is a figure showing registers, 

21 Figure 250 is a figure showing commands, 

22 Figure 25 1 is a figure showing read external ras memory, 

23 Figure 252 is a figure showing write external ras memory, 

24 Figure 253 is a figure showing flow control format. 

25 Figure 254 is a figure showing read internal data memory, 

26 Figure 255 is a figure showing write internal data memory, 

27 Figure 256 is a figure showing read four internal memories (partial crcl, partial crc2, 

28 reassembly table, output queue), 

29 Figure 257 is a figure showing write four internal memories (partial crcl, partial crc2, 

30 reassembly table, output queue), 

3 1 Figure 258 is a figure showing read port statistics. 

32 Figure 259 is a figure showing write port statistics, 

33 Figure 260 is a figure showing read CPU data memory, 

34 Figure 261 is a figure showing write CPU data memory, 

35 Figure 262 is a figure showing read 2 internal memories (free buffer list (fbl), output 

36 control), 

37 Figure 263 is a figure showing write 2 internal memories (free buffer list (fbl), output 

38 control), 

39 Figure 264 is a figure showing read internal header data memory, 

40 Figure 265 is a figure showing write internal header data memory, 

41 Figure 266 is a figure showing get CPU cell command, 

42 Figure 267 is a figure showing initialize free buffer list and output port queue memories 

43 command, 

44 Figure 268 is a figure showing summary of lookup engine block tags for various traffic 

45 types^ 

46 Figure 269 is a figure showing ATM header format. 
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1 Figure 270 is a figure showing MPLS label format 

2 Figure 271 is a figure showing ethemet frame format (IEEE 802.3 format), 

3 Figure 272 is a figure showing IP (internet protocol) header 

4 Figure 273 is a figure showing frame relay (fr) layer 2 frame format (shown with IP 

5 payoload). 

6 Figure 274 is a figure showing PPP full frame format for unnumbered mode operation, 

7 Figure 275 is a figure showing flow control entry format, 

8 Figure 276 is a figure showing tm to fabric flow control summary, 

9 Figure 277 is a figure showing fabric to tm control summary^ 

10 Figure 278 is a figure showing internal block diagram, 

1 1 Figure 279 is a figure showing graphic representation of an enqueue and dequeue FID. 

12 Figure 280 is a figure showing operation sequence table, 

13 Figure 281 is a figure showing enqueue block diagram, 

14 Figure 282 is a figure showing dequeue block digram. 

15 Figure 283 is a figure showing free buffer manager block diagram, 

16 Figure 284 is a figure showing random number generator, 

17 Figure 285 is a figure showing memory size, 

18 Figure 286 is a figure showing memory chip select, 

19 Figure 287 is a figure showing data dependency, 

20 Figure 288 is a figure showing RED memory^ 

21 Figure 289 is a figure showing RED memory byte enable, 

22 Figure 290 is a figure showing statistics counter location 0, 

23 Figure 291 is a figure showing statistics counter location 1, 

24 Figure 292 is a figure showing statistics counter byte enable, 

25 Figure 293 is a figure showing dequeue memory, 

26 Figure 294 is a figure showing dequeue memory byte enable, 

27 Figure 295 is a figure showing FID enqueue location 0, 

28 Figure 296 is a figure showing FID enqueue memory location 1, 

29 Figure 297 is a figure showing FID enqueue memory byte enable, 

30 Figure 298 is a figure showing BID external data structure, 

31 Figure 299 is a figure showing BID external data structure byte enable, 

32 Figure 300 is a figure showing dequeue memory, 

33 Figure 301 is a figure showing dequeue memory byte enable, 

34 Figure 302 is a figure showing multicasting FID enqueue memory location 0, 

35 Figure 303 is a figure showing multicasting FID enqueue memory location 1, 

36 Figure 304 is a figure showing multicasting FID enqueue memory byte enable, 

37 Figure 305 is a figure showing multicasting FID root memory, 

38 Figure 306 is a figure showing FID dequeue leaf tunneling internal data structure, 

39 Figure 307 is a figure showing FID enqueue leaf tunneling internal data structure location 

40 0. 

41 Figure 308 is a figure showing FED enqueue leaf tunneling internal data structure location 

42 1, 

43 Figure 309 is a figure showing FID enqueue leaf tunneling internal data structure byte 

44 enable, 

45 Figure 310 is a figure showing tunneling FID root data structure, 

46 Figure 31 1 is a figure showing interface signals. 
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1 Figure 3 12 is a figure showing CPU interface to PFQ^ 

2 Figure 313 is a figure showing segmentation interface diagram. 

3 Figure 314 is a figure showing segmentation interface. 

4 Figure 3 15 is a figure showing DBS & CBWFQ interface, 

5 Figure 3 16 is a figure showing DBS interface, 

6 Figure 317 is a figure showing memory and PFQ interface^ 

7 Figure 318 is a figure showing memory and PFQ interface signals^ 

8 Figure 3 19 is a figure showing memory manager commands to per flow queue, 

9 Figure 320 is a figure showing per flow queue commands to memory manager 

10 Figure 321 is a figure showing mem to PFQ conmiands, 

1 1 Figure 322 is a figure showing PFQ to mem commands, 

12 Figure 323 is a figure showing memory manager and the per flow queue interface, 

13 Figure 324 is a figure showing PFQ and memory manager PFQ command structure, 

14 Figure 325 is a figure showing PFQ to segmentation timing diagram, 

15 Figure 326 is a figure showing address space, 

16 Figure 327 is a figure showing registers space, 

17 Figure 328 is a figure showing nop command, 

18 Figure 329 is a figure showing read FID enqueue memory command, 

19 Figure 330 is a figure showing write FID enqueue memory command, 

20 Figure 33 1 is a figure showing read FID dequeue memory command, 

21 Figure 332 is a figure showing write FID dequeue memory command, 

22 Figure 333 is a figure showing read BID memory command, 

23 Figure 334 is a figure showing write BID memory command. 

24 Figure 335 is a figure showing read stat memory command, 

25 Figure 336 is a figure showing write stat memory command, 

26 Figure 337 is a figure showing setup connection command, 

27 Figure 338 is a figure showing tear-down connection command, 

28 Figure 339 is a figure showing init FID memories, 

29 Figure 340 is a figure showing init BID memory, 

30 Figure 341 is a figure showing timing that includes the FID enqueue, dequeue, BID and 

3 1 statistics memories. 

32 Figure 342 is a figure showing enqueue memory, 

33 Figure 343 is a figure showing RED memory, 

34 Figure 345 is a figure showing dequeue memory, 

35 Figure 346 is a figure showing BID memory, 

36 Figure 347 is a figure showing statistics memory, 

37 Figure 348 is a figure showing basic block diagram, 

38 Figure 349 is a figure showing interface to/from shaper, 

39 Figure 350 is a figure showing general signals, 

40 Figure 351 is a figure showing status interface from output spi, 

41 Figure 352 is a figure showing flowid memory, 

42 Figure 353 is a figure showing pkt memory, 

43 Figure 354 is a figure showing port calendar memory, 

44 Figure 355 is a figure showing shp_strict memoy, 

45 Figure 356 is a figure showing shp_out_port memory, 

46 Figure 357 is a figure showing register descriptions. 
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1 Figure 358 is a figure showing read external FID memory^ 

2 Figure 359 is a figure showing write external FID memory. 

3 Figure 360 is a figure showing setup FID connection. 

4 Figure 361 is a figure showing read external pkt memory. 

5 Figure 362 is a figure showing write external pkt memory^ 

6 Figure 363 is a figure showing read port calendar memory. 

7 Figure 364 is a figure showing write port calendar memory. 

8 Figure 365 is a figure showing read shp strict memory. 

9 Figure 366 is a figure showing write shp strict memory. 

10 Figure 367 is a figure showing read shp out memory^ 

1 1 Figure 368 is a figure showing write shp memory^ 

12 Figure 369 is a figure showing init shp_out_port memory^ 

13 Figure 370 is a figure showing init shp_strict memory^ 

14 Figure 371 is a figure showing timing diagram. 

15 Figure 372 is a figure showing shaper 

16 Figure 373 is a figure showing scheduler 

17 Figure 374 is a figure showing PFQ^ 

18 Figure 375 is a figure showing ingress/egress mode of operations. 

19 Figure 376 is a figure showing interface to/from scheduler 

20 Figure 377 is a figure showing scheduler's interface. 

21 Figure 378 is a figure showing signal description^ 

22 Figure 379 is a figure showing simple round-robin algorithm. 

23 Figure 380 is a figure showing weighted-round-robin scheduling, 

24 Figure 381 is a figure showing creating smaller bursts with factor 

25 Figure 382 is a figure showing strict priority, 

26 Figure 383 is a figure showing links between all levels of scheduling, 

27 Figure 384 is a figure showing links between memories during input phase (the ingress 

28 mode), 

29 Figure 385 is a figure showing links between memories during output phase (the egress 

30 mode), 

3 1 Figure 386 is a figure showing links between memories during output phase (the ingress 

32 mode), 

33 Figure 387 is a figure showing external memory - next flowid, 

34 Figure 388 is a figure showing internal memory - qos parameters, 

35 Figure 389 is a figure showing internal memory - qos descriptors, 

36 Figure 390 is a figure showing internal memory - port parameters, 

37 Figure 391 is a figure showing internal memory - port descriptor. 

38 Figure 392 is a figure showing internal memory - next port, 

39 Figure 393 is a figure showing registers, 

40 Figure 394 is a figure showing read FID next memory, 

41 Figure 395 is a figure showing write FH) next memory, 

42 Figure 396 is a figure showing read qos parameters, 

43 Figure 397 is a figure showing write qos parameters, 

44 Figure 398 is a figure showing read qos descriptor, 

45 Figure 399 is a figure showing write qos descriptor, 

46 Figure 400 is a figure showing read port parameters. 
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1 Figure 401 is a figure showing write port parameters. 

2 Figure 402 is a figure showing read port descriptor. 

3 Figure 403 is a figure showing write port descriptor. 

4 Figure 404 is a figure showing read port next. 

5 Figure 405 is a figure showing write port next. 

6 Figure 406 is a figure showing init qos. 

7 Figure 407 is a figure showing init port^ 

8 Figure 408 is a figure showing global timing for input/output stages, 

9 Figure 409 is a figure showing the input phase pipelines^ 

10 Figure 410 is a figure showing the output phase pipelines. 

1 1 Figure 41 1 is a figure showing shaper/meter block interface. 

12 Figure 412 is a figure showing traffic shaper/meter interface for dual-leaky-bucket, 

13 Figure 413 is a figure showing traffic shaper/meter interface for single-leaky-bucket, 

14 Figure 414 is a figure showing general signals, 

15 Figure 415 is a figure showing progress of a leaky bucket algorithm, 

16 Figure 416 is a figure showing links in the timing wheel, 

17 Figure 417 is a figure showing example for shaping FID in the timing wheel, 

18 Figure 418 is a figure showing calculating future time slot, 

19 Figure 419 is a figure showing the usage of peak and sustained rates, 

20 Figure 420 is a figure showing Rin and K factor, 

21 Figure 421 is a figure showing credit calculation diagram, 

22 Figure 422 is a figure showing calculation of threshold meter, 

23 Figure 423 is a figure showing memory description^ 

24 Figure 424 is a figure showing shaper rate ED for shaper mode, 

25 Figure 425 is a figure showing meter threshold for meter mode, 

26 Figure 426 is a figure showing shaper slot memory, 

27 Figure 427 is a figure showing FID meml for shaper, 

28 Figure 428 is a figure showing FID meml for meter, 

29 Figure 429 is a figure showing FID mem 2 for shaper, 

30 Figure 430 is a figure showing FID mem2 for meter. 

3 1 Figure 43 1 is a figure showing register description, 

32 Figure 432 is a figure showing read FTDl memory, 

33 Figure 433 is a figure showing write FIDl memory when using shaper, 

34 Figure 434 is a figure showing read FTD2 memory, 

35 Figure 435 is a figure showing write FID2 memory, 

36 Figure 436 is a figure showing read shp_slot memory, 

37 Figure 437 is a figure showing write shp_slot memory, 

38 Figure 438 is a figure showing read ratejd memory, 

39 Figure 439 is a figure showing write ratejd memory, 

40 Figure 440 is a figure showing setup connection command, 

41 Figure 441 is a figure showing read FIDl memory, 

42 Figure 442 is a figure showing write FIDl memory, 

43 Figure 443 is a figure showing read FID2 memory, 

44 Figure 444 is a figure showing write FID2 memory, 

45 Figure 445 is a figure showing read threshold (internal) memory, 

46 Figure 446 is a figure showing write threshold (internal) memory. 



12 



AZA-OOl/2001-POO 




PATENT 



1 Figure 447 is a figure showing read modwrite meter. 

2 Figure 448 is a figure showing setup connection command. 

3 Figure 449 is a figure showing timing diagrams for meter. 

4 Figure 450 is a figure showing block addresses. 

5 Figure 451 is a figure showing read access and timings definitions. 

6 Figure 452 is a figure showing read access timing and definitions table. 

7 Figure 453 is a figure showing write access and timings definitions. 

8 Figure 454 is a figure showing write access and timings definition table^ 

9 Figure 455 is a figure showing interface from the CPU IF to the blocks^ 

10 Figure 456 is a figure showing configuration modes. 

1 1 Figure 457 is a figure showing global sync. 

12 Figure 458 is a figure showing slot_count and CPU_port signals, 

13 Figure 459 is a figure showing multiplexing the test busses from the blocks, 

14 Figure 460 is a figure showing interface of the CPU IF block, 

15 Figure 461 is a figure showing internal implementation for locking the current CPU 

16 access parameters, 

17 Figure 462 is a figure showing register description, 

18 Figure 463 is a figure showing CPU interface signals, 

19 Figure 464 is a figure showing block addresses, 

20 Figure 465 is a figure showing CPUif block register, 

2 1 Figure 466 is a figure showing PFQ block register, 

22 Figure 467 is a figure showing read FID enqueue memory (72 bits) - up to 2m locations, 

23 Figure 468 is a figure showing write FID enqueue memory (72 bits) - up to 2m locations, 

24 Figure 469 is a figure showing read FID dequeue memory (36 bits)- up to Im locations, 

25 Figure 470 is a figure showing write FID dequeue memory (36 bits) - up to Im 

26 locations, 

27 Figure 471 is a figure showing read BID dequeue memory (36 bits) - up to 16m 

28 locations, 

29 Figure 472 is a figure showing write BID memory (36 bits) - up to 16m locations, 

30 Figure 473 is a figure showing read stat memory (72 bits) - up to 2m locations, 

31 Figure 474 is a figure showing write stat memory (72 bits) - up to 2m locations, 

32 Figure 475 is a figure showing setup connection command for FID, 

33 Figure 476 is a figure showing tear-down connection command for FID, 

34 Figure 477 is a figure showing init FID memories, 

35 Figure 478 is a figure showing init BID memory, 

36 Figure 479 is a figure showing database block registers, 

37 Figure 480 is a figure showing read external FID memory, 

38 Figure 481 is a figure showing write external FID memory, 

39 Figure 482 is a figure showing setup FID connection, 

40 Figure 483 is a figure showing read external pkt memory. 

41 Figure 484 is a figure showing write external pkt memory, 

42 Figure 485 is a figure showing read port calendar memory, 

43 Figure 486 is a figure showing write port calendar memory, 

44 Figure 487 is a figure showing read shp strict memory, 

45 Figure 488 is a figure showing write shp strict memory, 

46 Figure 489 is a figure showing read shp out memory. 
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1 Figure 490 is a figure showing write shp out memory. 

2 Figure 491 is a figure showing init shp_out_port memory. 

3 Figure 492 is a figure showing init shp_strict memory. 

4 Figure 493 is a figure showing shaper/meter block registers. 

5 Figure 494 is a figure showing read FIDl memory. 

6 Figure 495 is a figure showing write FDDl memory when using shaper 

7 Figure 496 is a figure showing read FID2 memory. 

8 Figure 497 is a figure showing write FID2 memory^ 

9 Figure 498 is a figure showing read shp_slot memory. 

10 Figure 499 is a figure showing write shp_slot memory. 

1 1 Figure 500 is a figure showing read ratejd memory^ 

12 Figure 501 is a figure showing write ratejd memory^ 

13 Figure 502 is a figure showing setup connection command^ 

14 Figure 503 is a figure showing read FIDl memory. 

15 Figure 504 is a figure showing write FIDl memory. 

16 Figure 505 is a figure showing read FID2 memory^ 

17 Figure 506 is a figure showing write FID2 memory^ 

18 Figure 507 is a figure showing read threshold (internal) memory^ 

19 Figure 508 is a figure showing write threshold (internal) memory^ 

20 Figure 509 is a figure showing readmod write meter. 

21 Figure 5 10 is a figure showing setup connection command^ 

22 Figure 5 1 1 is a figure showing sceduler block registers^ 

23 Figure 5 12 is a figure showing read FK) next memory^ 

24 Figure 5 13 is a figure showing write FID next memory. 

25 Figure 5 14 is a figure showing read qos parameters. 

26 Figure 5 15 is a figure showing write qos parameters. 

27 Figure 5 16 is a figure showing read qos descriptor. 

28 Figure 517 is a figure showing write qos descriptor. 

29 Figure 5 18 is a figure showing read port parameters^ 

30 Figure 5 19 is a figure showing write port parameters. 

3 1 Figure 520 is a figure showing read port descriptor. 

32 Figure 521 is a figure showing write port descriptor^ 

33 Figure 522 is a figure showing read port next^ 

34 Figure 523 is a figure showing write port next^ 

35 Figure 524 is a figure showing init qos^ 

36 Figure 525 is a figure showing init port^ 

37 Figure 526 is a figure showing parameters per flowid^ 

38 Figure 527 is a figure showing mx2 external memory. 
39 

40 DETAILED DESCRIPTION OF THE PREFERRED EMB ODIMENTS 

41 The present invention will now be described in detail with reference to a few 

42 preferred embodiments thereof as illustrated in the accompanying drawings. In the 

43 following description, numerous specific details are set forth in order to provide a 

44 thorough understanding of the present invention. It will be apparent, however, to one 

45 skilled in the art, that the present invention may be practiced without some or all of these 
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1 specific details. In other instances, well known process steps and/or structures have not 

2 been described in detail in order to not unnecessarily obscure the present invention. 
3 

4 System Overview 

5 The Maximus implements a multi-service approach with dynamic bandwidth 

6 allocation among the various flows carrying the different types of traffic. The Maximus 

7 device is designed such that it can be used in different applications that fit the needs of 

8 various system architectures. Two of the more popular applications are traffic manager 

9 and SAR solutions for OC-192 line cards in routing and switching systems. In one 

10 embodiment, the Maximus device is implemented in the form of two integrated circuit 

1 1 chips, designated the MX-1 and MX-2. In another embodiment, the entirety of the 

12 Maximus device is implemented on one integrated circuit chip (the circuitry of the MX-1 

13 and MX-2 integrated circuits are integrated onto a single integrated circuit chip). 

14 Figure 1 depicts an exemplary system level overview, illustrating how the 

15 Maximus device may enable OC-192 line cards in a router or switch system to receive 

16 different types of traffic, to dynamically allocate bandwidth among the various flows, to 

17 adapt the incoming traffic to the requirements of the cell-based or packet-based switch 

18 fabric, and to convert among the different traffic types for output. The term "router" as it 

19 is used in this document includes, but is not limited to, packet routers (for example, IP 

20 packet routers) and cell switches (for example, what are commonly called ATM 

21 switches). 

22 In Figure 1, each line card can receive both ATM and packet traffic coming in on 

23 different flows across a fiber optic connection. The Maximus device on each line card, 

24 acting as an ingress device interfacing with a framer device, can receive, buffer and adapt 

25 the incoming traffic to the switching requirements of either the cell or packet switch 

26 fabric. On the egress side, the traffic can be sent out to the framer device either in the 

27 same traffic type (e.g., ATM in, ATM out) or in a different traffic type (e.g., ATM in, 

28 packet out or packet in, ATM out). With regard to dynamic bandwidth allocation, the 

29 Maximus offers extensive QoS capabilities to shape and schedule the flows as desired 

30 according to their QoS priorities. 

31 Figures 2 and 3 below are two exemplary linecard views, illustrating how the 

32 Maximus device may be employed on an OC-192 line card operating with either a cell- 

33 based switch fabric (Figure 2) or a packet-based switch fabric (Figure 3). In these 

34 Figures, the traffic management or SAR solution is accomplished using two separate 

35 chips. 

36 In Figure 2, a traffic management solution in conjunction with a cell-based switch 

37 fabric is illustrated. Four OC-48 optics couple with a quad OC-48 Framer to achieve an 

38 aggregate OC-192 data rate. This is not a requirement, however, as the Maximus Ingress 

39 Traffic Manger and the Maximus Egress Traffic Manager can also directly couple with 

40 an OC-192 framer to receive data at 10 Gbps. The Maximus supports 64 incoming or 

41 outgoing ports and up to 1 million separate flows in each direction. The interface 

42 between the Framer and Maximus Traffic Manager chips is SPI-4 (System Packet 

43 Interface-4), phase 2 (i.e., LVDS running at 400 MHz). Data storage and buffering 

44 employs SDRAMs, while pointers and tables are stored in SSRAMs. The Maximus 

45 Traffic Manager chips interface with a cell-based switch fabric in the example of Figure 

46 2. 
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1 In the example of Figure 3, the Maximus Ingress Traffic Manger and the 

2 Maximus Egress Traffic Manager are employed in the SAR configuration. For example, 

3 the Maximus may receive ATM traffic, reassembles the ATM cells into packets along the 

4 ingress path to switch through a packet-based switch fabric. On the egress path, the 

5 packets may be segmented back into ATM cells for output to the quad OC-48 framers. 

6 However, due to the multi-service capabilities offered, packets may also be received and 

7 switched through the packet-based switch fabric. The data in these packets (whether 

8 received as ATM cells or packets) may be output to the Framer as either ATM cells or 

9 packets. 
10 

11 Applications 

12 Figure 4 shows the various applications, along with their combinations of 

1 3 incoming and outgoing traffic types. 

14 Application types 0-3 in Figure 4 represent the ingress traffic manager 

1 5 applications. Application types 8- 1 1 , on the other hand, represent the egress traffic 

16 manager applications. These ingress and egress traffic management application types (0- 

17 3 and 8-11) are graphically represented in Figure 5. Note that traffic management is 

18 offered not only for packets but also for ATM cells and AAL5 cells in a single pair of 

19 Maximus chips (ingress/egress pair). This reflects the multi-service approach of the 

20 Maximus. 

21 Figure 5 A shows uses of the Maximus integrated circuit chip with a cell-based 

22 switch fabric. The dashed box labeled "MX- 1" indicates the contents of one integrated 

23 circuit. Certain other parts of the integrated circuit (for example, the shaper block, the 

24 scheduler block, and the memory management block) are omitted from Figure 5 A in 

25 order to clarify the illustration. These other parts are described in further detail below in 

26 the later part of this patent document. The darkening of a block in Figure 5A indicates 

27 that the block is substantially disabled in that it is not performing its primary function. 

28 Blocks that are not darkened in Figure 5A are enabled to perform their primary functions. 

29 Application types 4-6 and 12-14 represent the SAR application types, along with 

30 their combinations of incoming and outgoing traffic types. These SAR application types 

31 (4-6 and 12-14) are graphically represented in Figure 6. Note that these SAR application 

32 types can handle not only segmentation of packets and reassembly of cells but can also 

33 support ATM cell encapsulation and packet bypass (application types 4, 6, 12, and 14). 

34 Again, this reflects the intelligent multi-service approach of the Maximus. 

35 Figure 6A shows uses of the Maximus integrated circuit chip with a packet-based 

36 switch fabric. The dashed box labeled "MX-1" indicates the contents of one integrated 

37 circuit. Certain other parts of the integrated circuit (for example, the shaper block, the 

38 scheduler block, and the memory management block) are omitted from Figure 6A in 

39 order to clarify the illustration. These other parts are described in further detail below in 

40 the later part of this patent document. The darkening of a block in Figure 6A indicates 

41 that the block is substantially disabled in that it is not performing its primary function. 

42 Blocks that are not darkened in Figure 6A are enabled to perform their primary functions. 

43 The data formats for the various application types set forth in Figures 5, 5A, 6 and 

44 6A are discussed below, with reference to the data format drawings of Figures 7-20. 
45 
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1 Data Formats 

2 Figure 7 illustrates the data format for Ingress Application Type 0 (ATM => 

3 ATM). In this application, the incoming data via the incoming SPI interface is ATM 

4 cells (52 bytes, which includes 4 bytes of ATM header and 48 bytes of data). Four bytes 

5 of pad is added prior to sending the incoming data to the lookup block (since the 

6 Maximus internal data path is 64-bit or 8-byte wide). The segmentation block adds 8 

7 bytes of pad to allow the memory manager to pass the 64-byte memory cells to SDRAMs 

8 for storage. The reassembly block adds either 8 or 16 bytes of header to the 64-byte 

9 memory cell prior to sending to the cell-based switch (via the outgoing SPI block). The 

10 reader is invited to review egress Application type 8 (Figure 14) for the analogous 

1 1 process on the egress side, if the data is to be output to the Framer by the egress Maximus 

12 chip as ATM cells. Egress application type 8 is discussed later herein. 

13 Figure 8 illustrates the data format for Ingress Application Type 1 (ATM => 

14 MPLS packet). In this application, the incoming data via the incoming SPI interface is 

15 ATM cells (52 bytes, which includes 4 bytes of ATM header and 48 bytes of data). Four 

16 bytes of pad is added prior to sending the incoming data to the lookup block (since the 

17 Maximus internal data path is 64-bit or 8-byte wide). The segmentation block removes 

18 the 4-byte ATM header and adds 12 more bytes of pad to allow the memory manager to 

19 pass the 64-byte memory cells to SDRAM for storage. The reassembly block adds either 

20 8 or 16 bytes of header to the 64-byte memory cell prior to sending to the cell-based 

21 switch (via the outgoing SPI block). The reader is invited to review egress Application 

22 type 9 (Figure 15) for the analogous process on the egress side, if the data is to be output 

23 to the Framer by the egress Maximus chip as MPLS packets. Egress application type 9 is 

24 discussed later herein. 

25 Figure 9 illustrates the data format for Ingress Application Type 2 (MPLS packet 

26 =>ATM). In this application, the incoming data via the incoming SPI interface is MPLS 

27 packet (Nxl6 bytes burst). The packet data is passed onto the lookup engine. The 

28 segmentation block segments the packet data into 48-byte AAL5 cells and appends 16 

29 bytes of pad to allow the memory manager to pass the 64-byte memory cells to SDRAM 

30 for storage. The reassembly block adds either 8 or 16 bytes of switch header to the 64- 

3 1 byte memory cell prior to sending to the cell-based switch (via the outgoing SPI block). 

32 The reader is invited to review Application type 10 (Figure 16) for the analogous process 

33 on the egress side, if the data is to be output to the Framer by the egress Maximus chip as 

34 ATM cells. Egress application type 10 is discussed later herein. 

35 Figure 10 illustrates the data format for Ingress Application Type 3 (Packet => 

36 Packet). In this application, the incoming data via the incoming SPI interface is packet 

37 (Nxl6 bytes burst). The packet data is passed onto the lookup engine. The segmentation 

38 block segments the packet data into AAL5-like 64-byte cells. In this segmentation, CRC- 

39 32 is generated and an AAL5-like trailer is added to the end of the last 64-byte cell. The 

40 entire 64-byte AAL5-like cell is passed onto the memory manager for storage in 

41 SDRAM. The reassembly block adds either 8 or 16 bytes of switch header to the 64-byte 

42 memory cell prior to sending to the cell-based switch (via the outgoing SPI block). The 

43 reader is invited to review Application type 1 1 (Figure 17) for the analogous process on 

44 the egress side, if the data is to be output to the Framer by the egress Maximus chip as 

45 packets. Egress application type 1 1 is discussed later herein. 
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1 Figure 1 1 illustrates the data format for Ingress Application Type 4 (ATM 

2 Encapsulation). In this application, the incoming data via the incoming SPI interface is 

3 ATM cells (52 bytes, which includes 4 bytes of ATM header and 48 bytes of data). Four 

4 bytes of pad is added prior to sending the incoming data to the lookup block (since the 

5 Maximus internal data path is 64-bit or 8-byte wide). The segmentation block adds 8 

6 bytes of pad to allow the memory manager to pass the 64-byte cell to SDRAM for 

7 storage. The reassembly block removes 8 bytes of pad, adds either 8 or 16 bytes of 

8 switch header prior to sending the resultant 64-byte packet to the packet-based switch 

9 (via the outgoing SPI block). The reader is invited to review egress Application type 12 

10 (Figure 18) for the analogous process on the egress side, if the data is to be output to the 

1 1 Framer by the egress Maximus chip as ATM cells. Egress application type 12 is 

1 2 discussed later herein. 

13 Figure 12 illustrates the data format for Ingress Application Type 5 (Reassembly). 

14 In this application, the incoming data via the incoming SPI interface is ATM cells (52 

1 5 bytes, which includes 4 bytes of ATM header and 48 bytes of data). Four bytes of pad is 

16 added prior to sending the incoming data to the lookup block (since the Maximus internal 

17 data path is 64-bit or 8-byte wide). The lookup block removes the 4-byte ATM header 

1 8 and passes the 48-byte ATM data to the segmentation block, wherein 16 bytes of pad are 

19 added to allow the memory manager to pass the 64-byte cells to SDRAM for storage. 

20 The reassembly block maps the 48-byte ATM cells stored in memory as 64-byte cell into 

21 packets by stripping 16 bytes of pad from each memory cell and adding a switch packet 

22 header before sending the resultant packet to the packet-based switch via the outgoing 

23 SPI. The reader is invited to review egress Application type 13 (Figure 19) for the 

24 analogous process on the egress side, if the data is to be output to the Framer by the 

25 egress Maximus chip as ATM cells. Egress application type 13 is discussed later herein. 

26 Figure 13 illustrates the data format for Ingress Application Type 6 (Ingress 

27 Packet Bypass). In this application, the incoming data via the incoming SPI interface is a 

28 packet burst (Nxl6 bytes). The packet data is passed to the lookup block, which 

29 subsequently passes the packet data to the segmentation block. The segmentation block 

30 segments the packet data into AAL5-like 64-byte cells. In this segmentation, CRC-32 is 

3 1 generated and an AAL5-like trailer is added to the end of the last 64-byte cell. The entire 

32 64-byte AAL5-like cell is passed onto the memory manager for storage in SDRAM. The 

33 reassembly block maps the 64-byte AAL5-like memory cell into packets by removing the 

34 AAL5 encapsulation and may add 8 or 16 bytes of switch packet header. The 

35 reassembled packet, including the 8/16 bytes of switch packet header is passed on to the 

36 packet-based switch via the outgoing SPI. The reader is invited to review egress 

37 Application type 14 (Figure 20) for the analogous process on the egress side, if the data is 

38 to be output to the Framer by the egress Maximus chip as packets. Egress application 

39 type 14 is discussed later herein. 

40 Figure 14 illustrates the data format for Egress Application Type 8 (ATM 

41 =>ATM). In this application, the incoming data (from the switch) via the incoming SPI 

42 interface is a switch cell which comprises 8/16 bytes of switch header, 4 bytes of ATM 

43 header, 48 bytes of ATM data, and 12 bytes of pad. The switch header is removed by 

44 the lookup block, resulting in a 64-byte cell. The 64-byte cell is then passed through the 

45 segmentation block to the memory manager to be stored in SDRAM as a 64-byte memory 

46 cell. The reassembly block translates the ATM header, removes 8 bytes of pad (leaving 4 
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1 bytes of pad behind to facilitate sending the resultant 52-byte ATM cell + 4 bytes of pad 

2 to the outgoing SPI interface (since the Maximus internal data path is 64-bit or 8-byte 

3 wide). These 4 bytes of pad are later removed to allow the 52-byte ATM cell to be sent 

4 out to the Framer (via the outgoing SPI block). 

5 Figure 15 illustrates the data format for Egress Application Type 9 (ATM 

6 =>MPLS packet). In this application, the incoming data (from the switch) via the 

7 incoming SPI interface is a switch cell which comprises 8/16 bytes of switch header, 48 

8 bytes of AAL5 data, and 16 bytes of pad. The switch header is removed by the lookup 

9 block, resulting in a 64-byte cell. The 64-byte cell is then passed through the 

10 segmentation block to the memory manager to be stored in SDRAM as a 64-byte memory 

1 1 cell. The reassembly block strips the 16 bytes of pad, performs a MPLS operation of 

12 adding the MPLS packet header based on the contents of the FlowID lookup table before 

13 sending the resultant packet out to the Framer (via the outgoing SPI block). 

14 Figure 16 illustrates the data format for the Egress Application Type 10 (MPLS 

15 packets =>ATM). In this application, the incoming data (from the switch) via the 

16 incoming SPI interface is a switch cell which comprises 8/16 bytes of switch header, 48 

17 bytes of AAL5 data, and 16 bytes of pad. The switch header is removed by the lookup 

18 block, resulting in a 64-byte cell. The 64-byte cell is then passed through the 

19 segmentation block to the memory manager to be stored in SDRAM as a 64-byte memory 

20 cell. The reassembly block adds the 4-byte ATM header and appends 4-byte of pad, as 

21 well as strip the 16-byte of pad. These 4 newly added pad bytes are subsequently 

22 removed to allow the 52-byte ATM cell to be sent to the Framer (via the outgoing SPI 

23 block). 

24 Figure 17 illustrates the data format for the Egress Application Type 1 1 (packet 

25 => packet). In this application, the incoming data (from the switch) via the incoming 

26 SPI interface is a switch cell comprising 64 bytes of AAL5-like data and an 8/ 16-byte 

27 switch header. The switch header is removed by the lookup block, resulting in a 64-byte 

28 cell. The 64-byte AAL5-like cell is then passed through the segmentation block to the 

29 memory manager to be stored in SDRAM as a 64-byte memory cell. The reassembly 

30 block maps the 64-byte AAL5-like cells into a packet, and adds a 8/16-byte packet 

3 1 header. The reassembly block also checks the CRC in the AAL5 trailer before removing 

32 the AAL5-like encapsulation. The resultant packet is subsequently passed on to the 

33 Framer (via the outgoing SPI block). 

34 Figure 18 illustrates the data format for the Egress Application Type 12 (ATM de- 

35 encapsulation). In this application, the incoming data via the incoming SPI interface is a 

36 switch packet, which comprises 8/16 bytes of switch header, 4 bytes of ATM header, 48 

37 bytes of ATM data, and 4 bytes of pad. The switch header is removed by the lookup 

38 block, resulting in 56-byte cell. The 56-byte cell is passed to the segmentation block, 

39 which adds 8 bytes of pad to allow the memory manager to pass the 64-byte cell to 

40 SDRAM for storage. The reassembly block translates the ATM header, strips out 8 bytes 

41 of pad (leaving 4 bytes to allow passing through the 8-byte/64-bit data path from the 

42 reassembly block to the outgoing SPI block). These last four bytes of pad will be 

43 removed, allowing the outgoing SPI to send the 52-byte ATM cell to the Framer. 

44 Figure 19 illustrates the data format for the Egress Application Type 13 

45 (Segmentation). In this application, the incoming data via the incoming SPI interface is a 

46 packet burst, including the 8/16 bytes of switch header. The packet data is passed onto 
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1 the segmentation block via the lookup block. At the segmentation block, the packet data 

2 is segmented into 48-byte AAL5 cells, and 16 bytes of pad are added to allow the 

3 memory manager to pass the 64-byte cells to SDRAM for storage. The reassembly block 

4 removes the 16 bytes of pad, adds the 4-byte ATM header to the 48-byte ATM data, and 

5 pads with 4 additional bytes to allow passing through the 8-byte/64-bit data path from the 

6 reassembly block to the outgoing SPI block. These last four bytes of pad will be 

7 removed subsequently, allowing the outgoing SPI to send the 52-byte AAL5 cell to the 

8 Framer. 

9 Figure 20 illustrates the data format for the Egress Application Type 14 (Egress 

10 Packet Bypass). In this application, the incoming data via the incoming SPI interface is a 

1 1 packet burst, which comprises 8/16 bytes of switch header and 64 bytes of AAL5 data. 

12 The packet data is passed onto the segmentation block via the lookup block. At the 

13 segmentation block, the packet data is segmented into 64-byte AAL5-like cells for 

14 storage in SDRAM memory. The reassembly block maps the 64-byte AAL5-like cells 

15 into packet, and adds a packet header of 0-16 bytes. It also checks the CRC in the AAL5 

16 trailer before removing the AAL5 encapsulation. The packet is then passed to the Framer 

17 via the outgoing SPI block. 
18 

19 Block Diagram 

20 Figure 21 is a high-level block diagram of the Maximus 1 integrated circuit chip, 

21 the Maximus 2 integrated circuit chip, and various memory devices coupled to the 

22 Maximus 1 and 2 integrated circuit chips. All of these integrated circuits are present, in 

23 one embodiment, on a line card of a router. The functionality of the Maximus 1 (MX-1) 

24 and Maximus 2 (MX-2) integrated circuit chips may be disposed on two different 

25 integrated circuit chips as illustrated in Figure 21 . Alternatively, the circuitry of the 

26 Maximus 1 and Maximus 2 integrated circuit chips is combined such that all that circuitry 

27 is disposed on a single integrated circuit chip. 
28 

29 Features 

30 General Features 



31 • Supports full duplex lOGbps performance 

32 □ Protocols and internetworking 

33 o ATM 

34 ■ Operation and Management (OAM) cells 

35 ■ Signaling support 

36 ■ VPA^C translation 

37 ■ ATM to packet internetworking with Segmentation and reassembly 

38 " VC merging support 

39 o Packets 

40 ■ MPLS 

41 • MPLS tag switching 

42 • IP to ATM internetworking with AAL5 segmentation and 

43 re-assembly. 

44 • MPLS header generation. Used for MPLS tunneling and 

45 aggregation 
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1 • MPLS merging several flows from different ports merged 

2 to single output. 

3 ■ Any packet with MPLS-like header 

4 o Management support 

5 ■ SNMP, MIB counters 

6 ■ Alarms and Events 

7 o Multicast support for both ATM and Packets 

8 • Applications supported: 

9 o Ingress Traffic Manager 

10 ■ ATM Cells in, ATM Cells out. 

1 1 ■ ATM (AAL5) Cells in, packets out. 

12 ■ MPLS Packets in, ATM (AAL5) Cells out. 

13 > MPLS Packets in, MPLS Packets out. 

14 o Egress Traffic Manager 

1 5 ■ ATM Cells in, ATM Cells out. 

15 ■ ATM (AAL5) Cells in. Packets out. 

17 ■ MPLS Packets in, ATM (AAL5) Cells out. 

1 8 ■ MPLS Packets in, MPLS Packets out. 

19 o Reassembler 

20 ■ ATM (AAL5) cells in, packets out 

21 ■ ATM cells in, ATM cell encapsulated inside packet out. 

22 ■ MPLS Packets in, MPLS Packets out. 

23 o Segmentation 

24 ■ MPLS Packets in, ATM (AAL5) Cells out. 

25 ■ ATM cell encapsulated inside packet in, de-encapsulated ATM cell 

26 out. 

27 ■ MPLS Packets in, MPLS Packets out. 

28 • Interfaces supported 

29 o Generic CPU Interface 

30 o SPI-4 phase-2 interfaces for both packets and cells. 

3 1 ■ 16-bit, 400 MHz, DDR interface. 

32 ■ Supports up to 64 ports (any combination of STS-1 - STS-192). 

33 ■ SPI-4 used for both incoming and outgoing interfaces. 

34 • Traffic management and QoS mechanisms 

35 o Traffic Management on Ingress 

36 ■ Priority scheduling with weighted queuing 

37 • Per Flow per egress port per class scheduling (up to Ik 

38 queues) 

39 • Flow control with the switch 

40 o Traffic Management on Egress 

41 ■ Traffic shaping through dual leaky bucket 

42 ■ Traffic scheduling 

43 • 64 Utopia level 4 ports 

44 • 8 Class of service 

45 • GCRA scheduling 



46 
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1 ■ Weighted fair queuing 

2 o VP tunneling support 

3 o CBR tunneling 

4 o MPLS tunneling 

5 CPU Interface 

6 • Supports CPU interface for management and control information. 

7 • Supports 32-bit CPU bus interface. 

8 • Support Mux/Non-Mux and Big Endian/Little Endian modes 

9 • Supports interrupt generation. 

10 o Hierarchical interrupt structure 

11 Incoming SPI-4 Interface 

12 • Supports the SPI-4 phase n specifications for 10 Gbps rate. 

13 • Unpacks incoming 16-bit DDR data at 400 MHz into 64-bit wide word at 200 

14 MHz. 

1 5 • Performs diagonal interleaved parity checking. 

16 • Extracts in-band port address, SOP, EOP and error-control code. 

17 • In-band start-of-FIFO status signal. 

18 # 200 MHz for the FIFO status transfer. 

19 • Supports maximum 64 ports. 

20 • Generates 2-bit FIFO status back to the transmit device using programmed port 

21 calendar. 

22 o Programmable port calendar with STS-1 granularity. 

23 Lookup Engine 

24 • Supports up to IM flows. 

25 • Supports up to 64 input ports. 

26 • Tables and configuration registers loaded via the processor interface. 

27 • Lookup Algorithm 

28 o Label generator selects any 6 bytes from the first 32 bytes of the packet. 

29 o Label generator is programmable on a per port basis. 

30 o The label generator has one byte of granularity. 

31 o Hash generator creates 2 hash values or slots in parallel from a key using 2 

32 separate CRC polynomials as hash functions. 

33 o Four bins per bucket to minimize collisions. 

34 o Hash table lookup constrained to resolve within 4 tries. 

35 o Overflow CAM of 256 or 64 entries, the largest available, for lookups that 

36 fail to resolve in 4 tries. 

37 o Fully pipelined, one lookup every 8 cycles. 

38 • Supports the following flows with the following headers: 

39 o MPLS with PPP encapsulation. 

40 o MPLS with LLC/SNAP encapsulation. 

41 o MPLS with Frame Relay encapsulation.. 

42 o Custom switch headers for traffic manager application. 

43 o Supports bypass lookup using MPLS-like header. 

44 Segmentation Engine 

45 • Segments incoming packet data into cells on a per port basis 
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1 o Supports a maximum of 64 external ports 

2 o Supports an additional port for data from processor 

3 o Segments incoming packet data into ATM cells 

4 o Appends AAL-5 trailer for ATM segmentation 

5 o Segments incoming packet data into Switch cells of 64 bytes 

6 o Appends AAL-5 like trailer for non-ATM segmentation 

7 • Supports programmable maximum packet length of 16K bytes. 

8 • Gathers per port statistics 

9 o 32-bit counter for incoming packets. 

10 o 48-bit counter for byte count. 

11 o 16-bit counter to count discarded packets. 

12 • Tags bad packets from SPI, to be discarded by per flow queue. 

13 • Internal SRAM to store per port segmentation context and data 

14 Memory Manager 

15 • Uses two channels of 128-bit (64-bit DDR) SDRAM 

16 • Manages read and write requests to all eight banks separately to maximize 

17 utilization. 

18 • Uses 128kb of internal SRAM for temporary storage of data to and from 

19 SDRAM. 

20 • Sopisticated load-sharing algorithm to distribute traffic over various banks. 

21 Reassembly and Header Processing Engine 

22 • Types of operation: 

23 o ATM cells to ATM cells no action required with translation 

24 o ATM cells to ATM cells with translation 

25 o ATM cells to Switch cells 

26 o Switch cells to Switch cells 

27 o Switch cells to Packet Data 

28 o ATM cells to Packet Data 

29 o AAL-5 cells to ATM cells, appending ATM header 

30 o AAL-5 cells to Packet Data 

3 1 Header Processing Features 

32 • Supports a maximum of IM total flows. 

33 • Types of lookup supported 

34 o No lookup, pass-through traffic 

35 o ATM processing 

36 ■ ATM Header append 

37 ■ VPI, VCI header translation 

38 ■ Encapsulate raw cells and 0AM cells into packets for routing 

39 through packet fabrics. 

40 ■ Encapsulate ATM cells into packets for routing through packet 

41 fabric. 

42 o MPLS processing 

43 ■ MPLS push/pop 

44 ■ MPLS header translation 
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1 Reassembly Features 

2 • Reassembles incoming cells into packets on a per port basis 

3 o Supports a maximum of 64 external ports 

4 o Supports an additional port for data to processor 

5 o Reassembles incoming ATM cells into packet data. 

6 o Strips AAL-5 trailer for ATM reassembly 

7 o Segments incoming Switch cells of 64 bytes into packet data. 

8 o Strips AAL-5 like trailer for non-ATM reassembly. 

9 • Supports programmable maximum packet length of 16K bytes. 

10 • Gathers per port statistics 

1 1 o 32-bit counter for outgoing packets. 

12 o 48-bit counter for byte count. 

13 o 16-bit counter to count discarded packets. 

14 • Tags bad packets to SPI, to be discarded by downstream device. 

15 • Internal SRAM to store per port segmentation context and data 

16 Outgoing SPI-4 Interface 

17 • Supports SPI-4 phase n spec at 10 Gbps rate. 

18 • Packs the 64-bit data at 200 MHz to 16 bit DDR data at 400 MHz. 

19 • Multiplexes the data and the control onto the SPI-4 bus. 

20 • Diagonal Interleaved parity generation. 

21 • Programmable data transfer burst sizes of multiple 16 bytes. 

22 • Passes the FIFO status information to the scheduler to stop transferring. 

23 • Schedules training sequence for data path de-skew. 

24 • Supports maximum 64 ports 

25 • Receives 2-bit FIFO status from the receive device using programmed port 

26 calendar. 

27 o Programmable port calendar with STS-1 granularity. 

28 Per Flow Queue Engine 

29 • See below, page 88. 

30 Traffic Shaper and Traffic Meter 

31 •See below, page 129. 

32 Output Scheduler 

33 • See below, page 118. 
34 

35 Signals 

36 See Figure 22. 
37 

38 MX 1 CPU Interface 

39 This block is responsible for three different functions: 

40 • Connects between the blocks and the CPU via the CPU interface. It decodes the 

41 address to Figure which block to access and generate the chip select to each 

42 block. 

43 • Generates global synchronization signals to the rest of the blocks. 

44 • Holds general control registers like version register, mode register etc. 
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1 

2 Features 

3 1 . Address bus of 10 bits per chip. 

4 Bits 0-5 are used as an address inside the block (up to 64 registers per block) 

5 Bits 9-6 are used to select a block (up to 16 blocks per chip) 

6 2. Data bus of 32 bits per chip. 

7 Externally this bus is b-directional 

8 Internally this bus is split to 2 busses: DATA_IN and DATA_OUT. 

9 3. Chip select signal is the qualifier for data, address and read/write busses. 

10 4. The CPU interface can support multiplexed CPU interface or non-multiplexed CPU 

1 1 interface. When using multiplexed interface, the address and data are folded to the 

12 DATA[31:0]bus. 

13 5. CPU accesses all registers in all blocks via direct read/write commands. 

14 6. CPU accesses all internal/external memories in all blocks via indirect accesses using 

15 the general-purpose registers dedicated for these accesses. 

16 7. Blocks addresses can be found in Figure 23. 

17 8. Generates a global sync signal once every 520 cycles of the 200MHz clock (65 ports, 

18 8 cycles per port). 
19 

20 Functional Description 

21 The CPU IF block is used to interface with the CPU. The CPU IF block decodes 

22 the incoming address bus to generate a unique chip-select signal to each block. The CPU 

23 IF block also holds general registers that contain general parameters and modes for the 

24 entire chip. Basically any logic that is not related directly to a specific block is in this 

25 block. 

26 The CPU IF block can operate in two modes depending on an input pins to select 

27 between them: 

28 • Regular CPU access: separate busses for data and for address. In this case address 

29 bus is qualified be the chip-select signal. 

30 • Multiplexed CPU access: one bus is multiplexed for address and for data. In this 

3 1 case address is qualified by the ALE signal. 

32 The CPU interface is based on Intel's microprocessor interface (when multiplexing 

33 address/data bus). The following timing diagrams defines the characteristics of the CPU 

34 interface for Read and for Write accesses: 
35 

36 Read access characteristics 

37 See Figures 24 and 25. 

38 • RDWR_L signal has the same timings as the ADDR bus. 

39 • A valid read cycle is defined when RDWR_L is latched as high on the negative edge 

40 of the CS_L signal. 

41 •In non-multiplexed address/data bus architecture, ALE should be held high so 

42 parameters tsair, tHair, tvL, tsir and tHir are not applicable. 

43 • In our implementation the address is latched on the negative edge of the CS_L 

44 therefore tHar is not applicable. 
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1 

2 Write access characteristics 

3 See Figures 26 and 27. 

4 • RDWR_L signal has the same timings as the ADDR bus. 

5 • A vahd write cycle is defined when RDWR_L is latched as low on the negative edge 

6 of the CS_L signal. 

7 • In non-multiplexed address/data bus architecture, ALE should be held high so 

8 parameters tsaiw, tHaiw, tvL, tsiw and tniw are not applicable. 

9 • In our implementation the address is latched on the negative edge of the CS_L 
10 therefore twaw is not applicable. 

11 

1 2 Timing diagram of signals to blocks 

13 The diagram in Figure 28 describes the timings of the CPU interface to each 

14 block. Note that chip-select signal is now asserted for only one cycle of the 200MHz 

15 clock. For a write or read cycle, all parameters are valid and stable before chip-select is 

16 asserted. When a read cycle occurs data is driven from the block after a maximum of 3 

17 clock cycles of 200MHz clock. 
18 

19 GSYNC - Synchronization signal 

20 GSYNC signal is a one-cycle pulse in the 200MHz clock, which is asserted once 

21 every 520 cycles. The SYNC signal is used for synchronization between the blocks and 

22 between the chips as well as to generate the slot-count and the CPU-port indication. See 

23 Figure 29. 



24 Slot count and CPU port indication 

25 The signals slot_count and CPU_port are generated inside each block base on the 

26 GSYNC signal. Each block first needs to sample GSYNC and use the sampled version to 

27 generate the signals. See Figure 30. 
28 

29 Interface 

30 The CPU IF block generates chip-select signal per block, the rdwrj, addr and 

3 1 data busses are propagated to the blocks directly. The blocks perform the read or write 

32 command based on the rdwr J signal only when the chip-select signal per the block is 

33 asserted. The chip-select signal per each block is synchronous to the 200MHz clock 

34 domain already. Each block first samples the incoming signals and then uses them. This 

35 is being done to solve any timing issues that may occur. If during placement and routing 

36 the signals from the CPU IF to the blocks will violate the 5nSec budget, another set of 

37 FFs can be placed in the middle of the path. 

38 For a read cycle the CPU IF block selects the data busses coming in from the blocks 

39 based on the chip-select per block (the address). See Figure 31. 

40 The address, data and rd/wr signals takes about 4 cycles of 200MHz to propagate 

41 from the chip's pins to the block (6 cycles are used for delay calculation). 2 cycles are 

42 needed to perform read or write accesses in the block. 

43 Therefore a total of 8 cycles are needed to perform a read or a write access. 
44 



26 



AZA-OOl/2001-POO 




PATENT 



1 Block Descriptions 

2 

3 CPU Interface 

4 The negative edge of the CS_L signal qualifies the ADDR[9:0] and the RDWR_L 

5 signals. The positive edge of the CS_L signal qualifies the DATA bus for read and for 

6 write accesses. The CS_L is basically an enable signal to latches. The circuit shown in 

7 Figure 32 shows the basic latching circuit and the synchronization to 200MHz clock 

8 domain: 
9 

10 Test Mux 

1 1 The Test Mux block is used to multiplex test mux outputs from all the blocks 

12 inside the chip. The output bus of the test mux go out to the pins that will be used for 

13 testing and debugging purposes. Refer to the Figure below for the Test Mux block 

14 diagram. The Tstmux_Blk_Sel register (refer to the Register section) is used to select 

1 5 which block signals to be output to the test pins (Tstmux_pin[3 1 :0]). Each block inside 

16 the chip also has selection register (e.g. Seg_Tstmux_Sel, Ras_Tstmux_Sel) to select a 

17 group of internal signals to be observed. 

18 The test mux outputs of all the blocks should be clocked out of flip-flops (refer to 

19 the diagram below). The CPU_clk that is running at 200 Mhz, is used to clock the output 

20 of the test mux inside the CPU Interface block. It is also going out to a test pin called 

21 Tstmux_clk. See Figure 33. 
22 

23 Soft Reset 

24 Each block inside MX- 1 chip can be reset individually by writing a ' 1 ' to its 

25 Soft_Reset register (refer to the Register section). Writing a '0' to the register will release 

26 the block from being reset and get it back to its normal operation. Each soft reset signal 

27 of each block is also coupled with the global power-on reset pin (POR_L). Refer to 

28 Figure 34 below. 
29 

30 Input/Output Disable 

3 1 Each block in the MX-1 chip has 2 inputs coming from the CPU Interface block 

32 that are used to disable the input and output of the block. The Input_Disable and 

33 Output_Disable registers (refer to the Register section) can be programmed to disable the 

34 input or/and output of a particular block. Refer to Figure 35 below. 
35 

36 Registers 

37 The following table describes the registers implemented in the CPU IF block. 

38 These registers act as control to the rest of the chip. The registers are implemented in the 

39 200MHz clock domain even if there is no need to do so since their value is fixed and 

40 stable during normal operation. See Figure 36. 
41 

42 Incoming SPI-4 

43 The Incoming System Packet Interface Level 4 (SPI-4) block receives 16-bit 

44 DDR ATM or Packet over SONET/SDH (POS) data from the Transmit Device (PHY or 

45 Switch based on ingress or egress application) and sends out-of-band FIFO status 

46 information to the Transmit Device. The incoming SPI-4 interface block outputs data 
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1 with control information to the lookup engine, and receives FIFO status from 

2 segmentation engine. Furthermore, the CPU can program the registers in the incoming 

3 SPI-4 interface block via CPU interface. 
4 

5 Features 

6 - Supports the SPI-4 phases n spec at 10 Gbps rate 

7 - Packs incoming 16-bit DDR data at 400MHz into 64-bit wide word at 

8 200MHz 

9 - Free buffer list to manage per port FIFO and absorb data while look up engine 

10 is doing the look up 

1 1 - Diagonal interleaved parity checking for data 

12 - Extracts in-band port address, SOP, EOP and error-control code and presents 

13 to the lookup engine 

14 - Passes 2-bit FIFO status back to the transmit device in a round robin fashion 

15 based on the port calendar 

16 - Programmable port calendar to output FIFO status 

17 - In-band start-of-FIFO status signal 

18 - 400 MHz for the FIFO status transfer 

19 - De-skews +/- 1 bit time after running the training patterns 

20 - Supports maximum 64 ports 
21 

22 Functional Description 

23 The incoming SPI-4 interface block receives data with in-band port address, 

24 delineation information and error control code from an external transmit device. It 

25 separates the data and control and passes them to the lookup engine. It also sends out-of- 

26 band flow control information to the transmit device when it receives the information 

27 from the segmentation block. See Figures 37, 38 and 39. 



28 

29 Block Description 

30 Bit Align 

31 The bit-align block adds programmable delays to both data and control lines to 

32 correct for relative skew differences of up to +/- 1 bit time. The programmable delays are 

33 controlled by the bit align control block which receives pre-defined training patterns to 

34 determine the adjustment. It outputs 32 bit SDR data with two control signals. The 

35 control signals indicate which 16-bit word is data or control word. See Figure 40. 
36 

37 Bit Align Control 

38 This block compares the received training patterns with the pre-defined patterns 

39 and determines the receive timing of the lines. Based on the timing, it generates the 

40 control signal the select different delays to de-skew the data and control lines. 
41 

42 Drop IDLE 

43 This block drops the extra idle control words from the incoming data and writes 

44 only data, training pattern, and the first detected idle control word into. the input FIFO. 

45 This block has a state machine which powers up in the idle state and remains in that state 
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1 until data or the first idle control word is detected. For the first detected idle control word, 

2 the state machine advances to the state named FIRST DDLE; and for non-idle control 

3 word it advances to the state named SCAN. If the state machine is in the FIRST IDLE 

4 state, it advances to the SCAN state if non-idle control word is detected, otherwise, it 

5 goes back to the IDLE state and asserts first Jdle signal. If the state machine is in the 

6 SCAN state, it stays in this state until idle control word is detected and it then advances to 

7 the FIRST IDLE state. The incoming data and control words are allowed to pass through 

8 and written together with the control signals into the input FIFO when the state machine 

9 is in the SCAN or FIRST IDLE states. See Figure 41 . 
10 

11 Input FIFO 



12 This input FIFO is a 256x34 dual-port SRAM running at 400MHz to absorb 

13 incoming SPI-4 data, control words, and control signals. The write operation is controlled 

14 by the drop IDLE state machine, and the read operation is controlled by the receive state 

15 machine (RX_SM) when the FIFO is not empty. 
16 

17 Rx State Machine 

18 The Rx state machine receives 32-bit words from the input FIFO with indications 

19 of data or control words of the two 16-bit words. It extracts the in-band control 

20 information, packs the data into 64-bit wide words, passes the training patterns to the bit 

21 align control block. It also sends the data and the control words to the parity-checking 

22 block to check the parity. 

23 A common control word format is used in both the incoming and outgoing 

24 interfaces. The Figure below describes the fields of the control word. When inserted in 

25 the data path, the control word is aligned such that its MSB is sent on the MSB of the 

26 incoming or outgoing data lines. A (payload) control word that separate two adjacent 

27 burst transfers may contain status information pertaining to the previous transfer, the 

28 following transfer, or both transfers. The table below shows a list of control word types. 

29 Maximus would ignore reserved control when detected from the incoming SPI-4 

30 interface. 

3 1 Figures 42 shows the SPI-4 control word format and Figure 43 and 44 contain the 

32 detailed description for the SPI-4 control word format. 

33 The Rx state transition table is shown in Figure 45 and is based on the types of 

34 control words received. Since the state machine has to decode two 16-bit words in one 

35 cycle, the table lists all the possible 16-bit words combinations. The 2-bit Ctrl signal is 

36 derived from the input DDR signal RCTL. Bit 1 is for the word 1 and bit 0 for word 0. 1 

37 indicates a control word and 0 is a data word. 

38 I_ctrl = Idle control word detected P_ctrl = Payload control word detected 

39 T_ctrl = Training control word detected 
40 

41 Parity Checking 

42 The parity-checking block calculates 4-bit Diagonal Interleaved Parity (DIP-4) 

43 and compares the calculated DIP-4 with the one received in the control word. Then it 

44 reports the error. The extent of the parity calculation is shown in Figure 46. A functional 

45 description of calculating the DIP-4 code is given in Figure 47. 
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1 Sync FIFO 

2 This FIFO is an 8x64 dual-port SRAM used to synchronize the 200MHz 

3 data/controls from the incoming SPI-4 to the internal clock. The write operation is 

4 controlled by the RX_SM and the read operation is controlled by the per_port control 

5 block. The per_port control reads from the sync FIFO as long as it is not empty. 
6 

7 SPI TBL 

8 The SPI TBL is a table of 64 entries, one for each port. The table contains control 

9 information for the incoming data burst for each port. 
10 

11 FBL 

1 2 The FBL (Free Buffer List) block is an 84x 10 single-port SRAM containing the 

13 pointers to the data FIFO that are free to accept data. 
14 

15 Data FIFO 

16 The data FIFO is a 672x64 dual-port SRAM consists of 84 blocks of 64 bytes. It 

17 is used to receive data for each incoming port and only send the data out if it has 

1 8 accumulated 64 bytes or reached end of packet. 



19 

20 Calendar 

21 The calendar block contains a port calendar conFigured at the startup time. An 

22 example of a port calendar is shown in Figure 48. 

23 The calendar block receives per-port-basis FIFO status from the segmentation and 

24 encodes the status of each port in a 2-bit data structure. Then it sends the status back to 

25 the transmit device in a round robin fashion based on the port calendar. The " U" pattern 

26 is reserved for in-band framing. A DIP-2 odd parity is sent at the end of each complete 

27 sequence, immediately before the "1 1" framing pattern. See Figure 49. A description of 

28 calculating DIP-2 is shown in Figure 50. 
29 

30 Memory Description 

31 

32 Summary of Internal Memories: 

33 See Figure 51. 

34 Interface 

35 Interface Ports 

36 See Figure 52. 
37 

38 Register Description 



39 See Figure 53. 

40 

41 Principles of Operation 

42 The incoming SPI block receives 16-bit DDR data, 1 -bit DDR control and a 400 

43 MHz clock from the LVDS pins of the Maximus chip. It first converts the 16-bit DDR 

44 data words into 32-bit SDR data words and a 1-bit DDR control into a 2-bit SDR control. 

45 The 2-bit control indicates which 16-bit word within a 32-bit word is the control word or 

46 data word. Based on the 2-bit control, the Rx state machine starts to extract the in-band 
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1 port address, start/end-of -packet indication and error-control code from the 32-bit data 

2 path and packs the 16-bit data words into 64-bit data words. 

3 Subsequently, the incoming SPI block outputs the 64-bit data words along with 

4 the control information at 200 MHz to the lookup engine. The incoming SPI block also 

5 calculates 4-bit Diagonal Interleaved Parity (piP-4) and reports an error if the calculated 

6 DIP-4 is not equal to the one received with the control word. 

7 The incoming SPI block also sends 2-bit FIFO status information back to the 

8 transmitting device. This FIFO status information is originated from the segmentation 

9 block indicating FIFO congestion conditions for all the ports. The SPI block sends the 

10 FIFO status to the transmitting device at the rate of one port per cycle in a round robin 

1 1 fashion based on the port calendar sequence. 
12 

13 Outgoing SPI-4 Interface 

14 The Outgoing System Packet Interface Level 4 (SPI-4) interface transmits 16-bit 



15 DDR ATM or Packet over SONET/SDH (POS) data to the receive device (PHY/Switch, 

16 depending on whether employed in the ingress or egress application) and receives out of 

17 band FIFO status information from the receive device. The outgoing SPI interface gets 

18 data and control information from the re-assembly engine, and passes the FIFO status 

19 information to the database. Furthermore, the outgoing SPI-4 interface also interfaces 

20 with the CPU through a software interface. The CPU can program the registers inside the 

2 1 blocks that comprise the Outgoing SPI-4 Interface. 
22 

23 Features 



24 - Supports SPI-4 phase n spec at 10 Gbps rate. 

25 - Unpacks the 64-bit data at 200 MHz to 16 bit DDR data at 400 MHz. 

26 - Muxes the data and the control onto the SPI-4 bus. 

27 - Diagonal Interleaved parity generation. 

28 - Programmable data transfer burst sizes of multiple 16 bytes with minimum of 

29 80 bytes. 

30 - Passes the FIFO status info to the database to stop transferring. 

3 1 - Schedules training sequence for data path de-skew. 

32 - Supports maximum 64 ports. 
33 

34 Functional Description 

35 The Outgoing SPI-4 Interface receives 64-bit data and control information from 



36 the re-assembly engine at 200 MHz. The Tx state machine generates 16-bit control 

37 words, unpacks the 64-bit data into 16-bit data words, and multiplexes the control and 

38 data words onto a 16-bit, 400 MHz DDR bus to an external receive device. The Outgoing 

39 SPI-4 Interface also passes the flow control information from the receive device to the 

40 database block within the Maximus chip. See Figure 54. 
41 

42 Block Description 

43 

44 Tx State Machine 

45 The Tx state machine stores and monitors the length field from the reassembly 

46 block. The length field is decremented on every data transfer from reassembly to SPI. 
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1 When the length reaches a threshold, the Tx state machine issues a request command to 

2 read the next port data and control information out of reassembly block. The threshold 

3 level is determined by a value that would allow back-to-back data transfer from 

4 reassembly block into SPI without gap on the data bus. 

5 The control word format is common for both incoming and outgoing interfaces. 

6 Refer to the Incoming SPI Interface document for details. 

7 The state machine also schedules to send the training patterns for the data path de-skew at 

8 least once every pre-defined DATA_MAX_T cycles. The training pattern is shown in 

9 Figure 55 from cycles 1 through 20a + 1. The a is the number of repetitions of the data 

10 training sequence. The DATA_MAX_T and a are conFigured at the startup. 

1 1 The state transition table is shown in Figure 56. At each state, two 16-bit words 

12 and pos_v and neg_v have to be output. And the pos_v and neg_v will be used to 

13 generate a DDR control signal. 1 indicates a control word and 0 a data word. If data_valid 

14 is set, there are data to be sent. Otherwise the state machine needs to generate one of the 

15 control words. 
16 

17 Data Parity Generation 

18 This block calculates 4-bit Diagonal Interleaved parity (DIP-4) and sends the 

19 DIP-4 along the control words. The extent of the parity calculation is shown in Figure 

20 57. A functional description of calculating the DIP-4 code is given in Figure 58. 
21 

22 Unpack Engine 

23 This block receives 64 bit data from Reassembly block and samples data at 

24 200MHz clock rate into the SPI out block. See Figure 59. 

25 The 64-bit data and a data valid signal are received following the control info 

26 read. The unpack engine unpacks the 64-bit input data into 16-bit words. In order to send 

27 16-bit DDR data at 400 MHz to the LVDS I/Os, the state machine has to come up with 

28 two 16-bit words every 400 MHz clock cycle. See Figure 60. 
29 

30 Configuration Register 

31 This block defines the organization of configuration space based on SPI-4 phase 2 

32 specifications. 
33 

34 Port Calendar 

35 This block stores the predefined calendar information to identify each port is 

36 ready to send data via the SPI interface. The port calendar contains three fields to indicate 

37 the PORT ID, JUMP and VALID. The port ID represents the port number for 

38 transmission of data and receiving of status. The port calendar can store up to 64 ports, in 

39 the event that less that 64 ports are used, JUMP indicates the end of calendar. VALID 

40 indicates if the port is valid or not. If a given port is not valid, the next PORT ID should 

41 be used, and the PORT ID marked as not valid should be skipped. 

42 When CPU updates the port calendar after initialization, CPU needs to meet the 

43 following conditions: 

44 - CPU can only modify one port at a time. 

45 . When CPU write a port, CPU needs to specify a port entry in the port calendar 

46 between port 0 to 63, and the specific port will be modified. 
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1 - When CPU read a port, CPU needs to specify a port entry in the port calendar 

2 between port 0 to 63, and the specific port will be read out. 

3 - CPU needs to stop the traffic for a given prior to modifying the port attribute 

4 (Jump, Valid). 

5 See Figure 61. 
6 

7 Port Calendar State Machine 

8 This block waits for rdy_4_nx_port_id from TX state machine to indicate the 

9 transmit state machine is ready to accept another port request. Upon receiving 

10 rdy_4_nx_port_id, calendar state machine issues the next port ID from the port calendar 

1 1 to the TX state machine. However, before issuing the next port ID, the status bit is check 

12 to ensure the SPI FIFO status is in the "Hungry" state for the given port. There is a 64-bit 

13 register to retain the value status on a per port basis. The block matches incoming status 

14 arriving into SPI against attributes from Port Calendar to identify the given port traffic 

15 status. 

16 After Port Calendar issues a valid port to the TX state machine, the Port Calendar 

17 State Machine pre-f etches the next Port ID from Port Calendar by performing a read. If 

1 8 the TX State Machine issues a read before a valid Port ID is inside Port Calendar State 

19 Machine, no_valid_port_id_available is returned to the TX State Machine. 



20 

21 Status Parity Check 



22 This block checks 2-bit Diagonal Interleaved parity (DIP-2) and verifies DIP-2 

23 parity bits for signal integrity. Assuming the TSCLK is running at 400 MHz, this block 

24 operates at the 400 MHz clock domain and samples the tstat [1 :0] twice every clocks. 

25 There are two 65-bit registers to hold the FIFO status. The first 65-bit registers 

26 hold the status value before parity check (pre_chk_st) and the second register holds the 

27 status after parity check (post_chk_st). Once the parity check passes, this block is 

28 responsible to copy the status from the pre_chk_st to the post_chk_st register. If there is a 

29 parity error, the error is reported into the control status register (spio_err [0]) by setting 

30 I'bl into the bit. This spio.err register is self-cleared upon CPU read. 

3 1 This block sends SPI status from post_chk_st to the Database block. The SPI port 



32 status will be sent in spio_dbs_stt [7:0], and status for port 0-7 should be sent when the 

33 spio_dbs_sync is asserted, and status for port 8-15 should be sent in the following clock, 

34 status 16-23 on the next clock, etc. SPI status spio_dbs_stt [8] should contain the status 

35 for the CPU port. spio_dbs_sync should be derived from the global_sync signal. 

36 When tstat [1:0] is 2'bOl, this indicates "Hungry" and transfer of up to 

37 ((MaxBurst2)*(16-byte)) blocks or the remainder of what was previously granted 

38 (whichever is greater) may be sent. Since Maximus internal cell size is 64-byte plus 

39 header (up to 16 bytes), after SPI-out provides an indication to Maximus to stop 

40 scheduling additional cell to memory, SPI-out needs to allow the remainder of current 

41 cell to go out of SPI. In other words, reassembly block does not support shipment of 

42 partial cell to SPI, each time SPI request for a cell, SPI must guarantee to be able to 

43 accept one complete cell. Therefore, the MaxBurst2 value should be programmed to 

44 minimum of 5. Once programmed to 5, when the "Hungry" indication is received, the 

45 remaining bytes in a given cell can still be shipped out. To notify Maximus to stop 

46 scheduling the next cell of data is the spio_dbs_stt bit from SPI-out to Database block. 
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1 Consequently, spio.dbs_stt should be set to one when "Hungry" is received for a given 

2 port. 

3 This interface between SPI-out and database drives the status of the 64 ports in 

4 the output SPI and the CPU port. It is a continuous interface. Every 8 cycles of 200MHz 

5 the status indication of the 65 ports is updated to a 65 bits register. The first cycle of the 

6 eight is marked with a specific signal (sync signal). 

7 Figure 62 shows the interface: 

8 A synchroniztion FIFO is used between the 400 MHz and 200 MHz clock 

9 domain. 

10 This block detects the synchronization pattern from tstat [1:0] by comparing the 



1 1 incoming tstat[l :0] against the framing pattern. Also, the port calendar state machine 

12 checks training sequence to see if there is any skewing on FIFO status and control lines. 

13 To check the training sequence for skew, this block needs to implement a counter for 

14 internal tstat, called sync [1:0], which cycles the nominal value of tstat [1:0] with respect 

15 to Maximus internal clock. The value of sync is compared against the value of tstate [1:0] 

16 (external) from the SPI pad and the difference between the two pairs of signals 

17 determines if there is any skew of control signals. See Figure 63. 
18 

19 TX FIFO 



20 This block implements a 4x 16-bit dual port SRAM to synchronize Maximus and 

21 SPI clock domains. On the incoming side, the write port operates at 200 MHz and pushes 

22 data into the TXFIFO when data_push is asserted by TX state machine. On the outgoing 

23 side, the read port operates at 400 MHz DDR and pops data into the tdat [15:0] when 

24 data_pop is asserted by TX state machine. 
25 

26 TX Status FIFO 

27 This block synchronizes status from 400 MHz to 200MHz domain using 4x 16-bit 

28 dual port SRAM. 
29 

30 Memory Description 

31 

32 Data Structure for Port Calendar 

33 See Figure 64. 

34 - Memory type: single-port SRAM 

35 - Memory configuration: 64 x 8 bit 

36 o Port ID: hold port identification value progranmied by the CPU. 

37 o Jump: indicates the end of the port calendar. When Jump is asserted, 

38 the given location is the last entry on the port calendar. Upon detecting 

39 the assertion of jump, the port calendar should recycle the port ID 

40 from the top of the port calendar. 

41 o Valid: When valid is asserted, it indicates the given port hold a valid 

42 connection. When valid is de-asserted, the port calendar v^ill skip the 

43 de-asserted port and go to the next port ID on the port calendar. 
44 

45 Data Structure for TX FIFO 

46 - Memory type: dual-port SRAM 
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1 - Memory configuration: 4 x 16 bit 

2 The TX FIFO is used to synchronize data between different clock frequencies. 
3 

4 Data Structure for Status FIFO 

5 - Memory type: dual-port SRAM 

6 - Memory configuration: 4 x 16 bit 

7 The Status FIFO is used to synchronize data between different clock frequencies. 
8 

9 Interface 

10 See Figure 65. 
11 

12 Timing Diagram 

13 Timing information between SPI-out and Reassembly, see Figures 66 and 67. 
14 

15 Register Description 

16 Start-up Parameters, see Figure 68. 
17 

18 Software Interface 

19 The software interface programs Outgoing SPI with a CPU. All Outgoing SPI's 

20 control registers and memories can also be written and read for test purposes through the 

21 software interface. The following sections list all the control registers and all the 

22 Outgoing SPI software interface commands. Please see a dedicated document for a 

23 detailed description of the software interface. 
24 

25 Registers 

26 See Figure 69. 
27 

28 Commands 

29 

30 Read Outgoing SPI Start-up Parameters. 

31 See Figure 70. 

32 Write Outgoing SPI Start-up Parameters 

33 See Figure 71. 

34 Read Outgoing SPI Start-up Parameters 

35 See Figure 72. 

36 Write Outgoing SPI Start-up Parameters 

37 See Figure 73. 
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1 Read Outgoing SPI Start-up Parameters 

2 See Figure 74. 

3 Write Outgoing SPI Start-up Parameters 

4 See Figure 75. 

5 Write PORT Parameters after Initialization 

6 See Figure 76. 

7 Read PORT Parameters after Initialization 

8 See Figure 77. 

9 Read PORT Full Parameters 

10 See Figure 78. 

1 1 Write PORT Full Parameters 

12 See Figure 79. 

13 Read Control Status Register 

14 See Figure 80. 

15 Write SPIO_TSTMUX_SEL 

16 See Figure 81. 

17 Read SPIO_TSTMUX_SEL 

18 See Figure 82. 

19 Reserved Op-codes 

20 All op-codes other than listed above are reserved. 
21 

22 Principle of Operation 

23 The outgoing SPI block receives FIFO status from the Incoming SPI. The 

24 beginning of the status is found based on the framing sequence. The beginning of the 

25 status from SPI interface is mapped to the port calendar to look up the status of each port. 

26 The status has to be synchronized to the intemal 200 MHz clock and passed to the 

27 database block, and intemal state machine to select a port that is available for data 
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1 transmission. Base on the status information, outgoing SPI block requests data from re- 

2 assembly block on a given port. Outgoing SPI unpacks the port data from re-assembly, 

3 delineation data between different ports and check for data error using control 

4 information from re-assembly. The outgoing SPI block calculates the 4-bit Diagonal 

5 Interleaved Parity (DIP-4) and combines those four bits with the control information to 

6 form a 16-bit control word. At every clock cycle of the 400 MHz clock, the outgoing SPI 

7 unpacks the 64-bit data into 16-bit data words and inserts control words by multiplexing 

8 onto 16-bit DDR bus at 400 MHz. If the re-assembly's control FIFO is empty, the 

9 outgoing SPI block sends idle control words onto SPI data bus. During regular interval, 
10 the outgoing SPI also sends training control word to provide skew detection. 

11 

12 Lookup Engine 



13 This block receives ATM/Packet data and control from the SPI-4 receive interface 

14 block. It looks up a Flow ID and a set of parameters from a hash table in external 

15 SSRAM and forwards the data flow with the parameters to the segmentation engine. 
16 

17 Features 

18 • Supports up to IM flows, 

19 • Supports up to 64 input ports. 

20 • Operates at 200 MHz/5ns cycle time. 

2 1 • Tables and configuration registers loaded via CPU Interface. 

22 • Label extraction is programmable on a per port basis. 

23 • Accepts packet SOP cells of 64 Bytes only. 

24 • Accepts ATM cells of 52 Bytes only. 

25 • Direct header stripping is programmable on a per port basis. 

26 • Hash generator creates 4 hash values or slots in parallel from a key using 4 separate 

27 CRC polynomials as hash functions, 

28 • Two bins per bucket or record to minimize collisions. 

29 • There are two record memories for up to 8 read operations per a tag lookup. 

30 • Hash table lookup constrained to resolve within 8 read operations. 

3 1 • Overflow CAM of 32 X 72 entries for lookups that fail to resolve in 8 read 

32 operations. 

33 • Fully pipelined lookup operations. 

34 • Supports a maximum of two 8 X 72 M SSRAM. The normal size is two 2 X 72 M 

35 SSRAM. 

36 • Supports the following flows with the following headers: 

37 o MPLS with PPP encapsulation. 

38 o LC-ATM that is VPWCI. 

39 o Switch headers. 

40 o FIDS. 

41 • Operates in ingress and egress modes. 
42 

43 Functional Description 

44 See Figure 83. 
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1 The lookup engine does label extraction, label hashing and table lookup for each 

2 data transfer that is SOP with a header. The type of traffic is unique to the incoming port. 

3 There will be no more than one type traffic on a port. The default type will get a special 

4 FID. The default FID is stored in a memory and it is specific to the input port. The lookup 

5 engine can strip the switch header of incoming cells. 
6 

7 Block Descriptions 

8 See Figure 84. 
9 

10 CPU Hash Generation 

1 1 The CPU through the interface can request a hash generation and get a hash index 

12 as a response. It will be used to set or tear down connections. 
13 

14 Port Configuration 

15 The memory has two uses. It specifies the type of traffic on that port. Also, it has 



16 the Traffic parameters. The CPU either writes the parameters or the logic after lookup. If 

17 the CPU writes the parameters, then they are not changed until the traffic type is. They 

18 will be used in default traffic. If the traffic is direct, then the rest of the parameters in the 

19 memory for that port are not used. If the traffic is ENCAP type, then another set of 

20 registers are used. When the port traffic is none of the above, then the parameters values 

21 such as the type and FID are looked up from the record memory for SOP traffic. Then at 

22 the last stage they are driven on the interface bus. At the same time they are written 

23 immediately in the port configuration memory. For none SOP cells, the parameters are 

24 read from the port memory at the last stage and then driven on the bus. This will avoid 

25 data coherency problem in case the cells for the same port are contiguous. For ATM cells 

26 the SOP and HOP cell is the same. 
27 

28 DATA Coherency 



29 The parameters on none SOP cells are read near the last stage of the data in order 

30 to avoid driving stale values on the interface bus. The CPU accesses to modify the 

3 1 memory are not tracked. The parameters used are the ones that are lookup or stored in the 

32 port memory. 

33 If the SOP cell has a match on the last lookup, the parameters are written in the 

34 port memory near the end of the lookup cycle. If the next data cell is for the same port, 

35 then the parameters are not written yet in the corresponding memory location. Therefore, 

36 if the cell is not SOP, the port memory is read near the end of the pipeline of the cell 

37 lookup. See Figure 85. 
38 

39 CPU Accesses 

40 The CPU accesses are allowed between lookup sequences when the memory bus 

41 is not busy. 
42 

43 External SSRAM 

44 The Flow ID tables are implemented in off chip rams using 200MHZ, ZBT 

45 SSRAMS. These rams provide single cycle access with no 'dead cycle' penalty when 

46 mixing reads and writes. The table will contain a maximum of 8M entries. There is a 
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1 programmable register that will determine the size of the memory. The hash result will be 

2 trimmed to a number of bits that is programmable. There are two SSRAM channels 

3 provided. The usual size is 2M entries per channel. The memory can be accessed directly 

4 without hash for a read and write operations. 
5 

6 SRAM INITIALIZATION 



7 The internal as well as the external ram will be initialized to 

8 

9 HEADER Selection 

10 The header depending on the type has different sources. It can be transmitted with 

1 1 the traffic as in the case of direct or the special type. It can be found in the port memory 

12 as in the case of the default. Last, it can be found in record memory. See Figure 86. 
13 

14 Hash FIFO and Record Reading 

15 See Figure 87. The data is stored until the port table is accessed. The label is 

16 generated. The data will continue to be clocked in a FIFO until the key is hashed and the 

17 index is calculated. 
18 

19 Memories 

20 Input Port Parameters Memory 

21 Each port will have a parameter entry in that table. The "type" parameter will 

22 determine if the rest of the entries are used. If the type is "000", then the rest of 

23 parameters of that memory entry are not used except for the header strip and tag 

24 extraction. The "HD STRP" indicates the number of bytes to strip from the header. The 

25 "INT HDR EXTRACT" indicates the number of bytes that represent the header. The 



26 header is at the end of the header stripped that is stripped. If the type is default, then the 

27 header to strip is zero. If the header to strip is not equal to zero, then the header will be 

28 stripped as in the case of the direct. The same argument is for the tag extract. The header 

29 will contain the FID, CLP, EFCI, and the 0AM bits. The bits will be extracted and used 

30 in lieu of the look up data. The number of bytes of the header that are stripped and the 

31 parameter location is specified in the table. If the type is "001", then it is a default type 

32 and the rest of the parameters of that memory entry are used. If the type is "001", then the 

33 incoming traffic will be treated differently. The traffic is special and it is treated later in 

34 the specifications in detail. Again there are no hashing or lookup operations. If the type is 

35 different than the above three, then the tag is extracted, hashed and the parameters are 

36 looked up from memory. There are 64 entries in the table because the device can support 

37 a maximum of 64 ports. When there is a lookup operation, the parameters that are looked 

38 up are stored in the port memory and are used on the traffic until the next SOP cell. 

39 Infields where the hardware and the software perform the write operations, the software 

40 does it for the default traffic only, the other types of traffic are updated it by the 

41 hardware. See Figure 88. 

42 The CPU writes the data in the port table entries for the default or the direct type. 

43 The logic will write the parameters for the port if it the type is not any of the above. The 

44 SOP cell will trigger the lookup. The rest of the cells on that port will use the table until 

45 the EOP or the next SOP. Then the parameters will change again. 
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1 In 12 encapsulation, if the CRC bit is set, then assert the CRC signal to the segmentation 

2 engine. See Figures 89 and 90. 
3 

4 Table Lookup Memory 

5 The index or hashed key provided by the hash generator is used as an address. 

6 The number of bits that are used from the hashed key or index is programmable and 

7 depends on the size of the SRAM. The address is a base one and it is used to access four 

8 entries in the hash table located in external SSRAM. When these locations are read, the 

9 hash key is compared to the content of the memory. If the first location does not match, 

10 then there is collision and the second one is compared the same way. The process is 

1 1 repeated for all four entries. If there is no match and collisions resulted from the compare 

12 operations, then the process is repeated for the second hash result or index and so forth. If 

13 at the end there are only collisions, then the CAM is accessed and the process of 

14 comparison is repeated. If there is a hit, then in memory location there is Flow ID (FID), 

15 the QOS, and the rest of the control bits. Each hash table entry requires 72 bits. The bit 

16 definitions are summarized in Figure 91. The output port number is used to access a 

17 register. If the register bit corresponding to that port is set, then the EOF cell will have 

1 8 the error or discard flag set. 
19 

20 Hash Data Memory 

21 See Figure 92. The data FIFO is a 15 X 104 bit wide FIFO. The FIFO 

22 compensates for the pipeline delay of the lookup path. It also provides buffering that is 

23 needed when the segmentation engine is not ready to accept any data. The data receiver 

24 block writes the FIFO. The data transmitter block reads the FIFO. The segmentation 

25 engine will assert a ready signal when it can accept data. When the ready is asserted, then 

26 the data is pushed to the segmentation engine. 
27 

28 Interface 

29 See Figures 93 and 94. When the segmentation engine asserts the 

30 "ALMOST_FlJLL_SIGNL", the lookup engine will finish transferring the cell of eight 

3 1 transfers. Then at the cell boundary, the lookup engine will terminate transmission. At the 

32 same time the lookup engine will signal the SPI interface to terminate pushing data to the 

33 look up engine. The pipe of tag lookup operations is not violated. There are no lookup 

34 operations when the traffic flow is suspended. 
35 

36 CPU Interface 

37 There are signals that are generated by the CPU interface and are used to control 

38 the enable and disable signals. See Figures 95 and 96. 
39 

40 Software Interface 

41 See Figures 97 and 98. 

42 The default value for the memory size is "001" which corresponds to 2 MEG. 
43 

44 FID Memories Select 

45 See Figures 99 and 100. The default value for chip select is "00". 
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1 

2 Register Description 

3 The CPU interface doesn't support burst capability, instead, any read or write 

4 command will terminate with one address location; data bus is still 32 bits. Lx)okup 

5 Engine block has a fixed size of addresses allocated to it and all accesses to any 

6 internal/external memories or register is done via that specific address space. 

7 This method require the CPU Interface manager to have a mapping of the address spaces 

8 per block so that an incoming command will be redirect to the correct block. 
9 

10 Features 

1 1 • Data bus of 32 bits. 

12 • Single address access (not a burst access) 

13 •All external memories and most of the internal memories are accessed via indirect 



14 accesses through registers 

15 • Each block has dedicated addresses for its internal registers of up to 64 registers. 

16 A command drives address and command (Rd or Wr) to the address FIFO and data if it's 

17 a write command. Each block starts it's own CPU access by POPING the 

18 address/command from the FIFO. It means that any CPU transaction starts when the 

19 address/command FIFO is not empty. 
20 

21 Address space 

22 The address space is 64 entries only. The internal registers directly mapped to 



23 these addresses and external/internal memories are indirectly mapped. 

24 The address spaces per lookup engine are shown in Figure 101. 
25 

26 Direct/Indirect Accesses 



27 A direct access is used to access. If there are internal memories or registers, which 

28 are longer then 32 bits, then read or write to these locations will be executed in more then 

29 one access. For example an entry in internal memory of 50 bits width (or a register with 

30 the same length) will be accessed in two CPU commands first access bits [31:0] and then 

31 [49:32]. Each register or small internal memories have their own address, which is unique 

32 to the entire chip set. 

33 An indirect access is used to access internal and external memories. Each block 



34 has a set of registers dedicated for the indirect access. One register contains the address 

35 and the command (Rd, Wr, Init etc). Few other registers contain the data for a write 

36 command. Writing to the COM/ ADD register triggers the execution of the conmiand, 

37 once the block completed the execution of the command, the COM portion of the 

38 command is cleared by the block. When a read command occurs the data is written to the 

39 registers and then the CPU reads the registers to get the data. 
40 

41 Registers 

42 See Figure 102. 
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1 

2 Commands 

3 NOP 

4 See Figure 103. 

5 Read LUT Memoryl (72 bits) 

6 See Figure 104. 

7 Write LUT Memoryl (72 bits) 

8 See Figure 105. 

9 Read LUT Memory2 (72 bits) 

10 See Figure 106. 

1 1 Write LUT Memory2 (72 bits) 

12 See Figure 107. 

13 Read Port Memory (44 bits) 

14 See Figure 108. 

15 Write Port Memory (44 bits) 

16 See Figure 109. 

17 Setup connection command for KEY 

18 See Figure 1 10. This command sets up a connection by accessing external 

19 memories and initializes the corresponding locations to the proper parameters. It is a 

20 write command. 

21 Tear-Down connection command for KEY 

22 See Figure 111. This command tears down a connection by accessing external 

23 memories. It is a write command 
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1 GET HASH FID 



2 See Figure 1 12. The response will be in the register. See Figure 1 13. This 

3 command tears down a connection by accessing external memories. It is a write 

4 command. 

5 Read CAM Memoryl (72 bits) 

6 See Figure 1 14. 

7 Write CAM Memoryl (72 bits) 

8 See Figure 115. 

9 Read CAM Memory2 (72 bits) 

10 See Figure 1 16. 

1 1 Write CAM Memory2 (72 bits) 

12 See Figure 117. 

13 INIT ALL MEMORIES 

14 See Figure 118. 

15 INIT MEMORY 1 & CAM 1 

16 See Figure 119. 

17 INIT MEMORY 2 & CAM 2 

18 See Figure 120. 
19 

20 General commands 

21 The following commands are combinations of the above simple commands. A 

22 user command like this should be broken to several Read or Write commands in the 

23 driver. 

24 

25 Initialized External memories 

26 • Write "0" in all Record Memory 1. 

27 • Write "0" in all Record Memory 2. 
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1 

2 Initialized Internal memories 

3 • Write "0" in all Port memory. 
4 

5 Principle of Operation 

6 The lookup engine has to perform a port lookup before the FID lookup operation is 

7 performed. The memory is initialized and the valid bit is set for that FID. IF the valid bit 

8 is not set, then there will be no match. The lookup engine has the port of incoming traffic. 

9 As a result the following is done: 

10 1 . Read the port parameter entry by accessing the port parameter table. 

11 2. Get the LOOKUP TYPE. 

12 3. Extract the tag. 

13 4. Perform the stripping, hashing, or lookup operations according to the type. 

14 Once the parameter entry is read, then the lookup type is known. If the lookup type is 

15 direct, then the lookup engine will extract the FID, QOS, CLP, EFCI, and 0AM if any 

16 and push all the control and data to the segmentation engine. If the traffic is default, then 

17 the parameters from the port table are used. If the traffic is L2, then the operation is 

18 handled as described in another section. If the traffic type is none of the previous ones, 

19 then the tag is extracted according to the type of traffic and then it is hashed. The result of 

20 hashing will be used to access the memory to get the rest of the parameters if there is a 

21 match. The memory content of the tag is compared to the hash tag. If they are equivalent, 

22 then the rest of the memory content is used for that FID, QOS, CLP, EFCI and 0AM. If 

23 the tags do not compare then there is collision and the next hash result is used. There are 

24 four hash generators that will generate 4 indices. Two indices will be used for one 

25 memory and the other two for the second memory. Each record holds two bins. The total 

26 is 8 bins that might collide. If there is no match but collisions, then the overflow CAM is 

27 accessed. If there is collision or no record, then the packet or the cell is marked for CPU 

28 and a register is used for the FID. 
29 

30 CPU FIDS 

31 If the keys stored in the memory locations do not match the hashed key, then the 

32 CAM content will be compared and the if it fails to compare, then the traffic will be sent 

33 to the CPU. There is a register that holds the FID and the rest of the bits necessary for the 

34 data traffic. 
35 

36 TAG Extraction 

37 The lookup engine requires the minimum burst size of 16 bytes for the key 

38 extraction. The hardware extracts a contiguous number of bytes starting at an offset 

39 according to the traffic type of that port. The type of label to be extracted is determined 

40 by the port number. The location of the label is dependent of the type of traffic and is 

41 hard coded in the hardware. 
42 
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1 ATM TYPES 

2 If the type of the traffic is ATM, then the CLP, EFCI, and 0AM bits are extracted 

3 and driven to the rest of the device. See Figure 121. Extract the above bits from header 

4 and drive to segmentation. 
5 

6 TAGS 

7 The following Figures show the data formats for each of the supported protocols. 

8 The specifications are not hard coded. The port table will be used for the header's 

9 location. Each field is 8 bits wide with 12 bits VPI and 16 bits VCI field, for a total of 28 

10 bits lookup tag. This 28 bits lookup tag is located in the l" 64-bit word, bits 32 thru 63. 

1 1 See Figures 122 and 123. The PPP protocol is part of the link layer header. The tag is bit 

12 60:41 of the first 64 bits word. 
13 

14 Port Types 

15 The first lookup is to read the port parameters. If the type is "000", then the type 



16 is direct flow. There are no hash or lookup operations. The FID will be extracted from the 

17 incoming traffic itself If the type is "001", then it is a default traffic. The default 

18 parameters are stored in the table itself. There are no hashing operations or lookup in this 

19 case. The third special case is the L2 type. The incoming header will be used to find the 

20 encapsulation header. There will be no hash and lookup operations either. The rest of the 

21 lookup types require a hash and lookup operation. The port table will specify the type of 

22 lookup that the lookup engine needs to perform on the incoming data. The lookup 

23 operation will assign an FID to the data stream or packet or flow, except in the bypass 

24 type. The FID is used only internally. See Figure 124. 
25 

26 Direct Header Format 

27 See Figures 125-7. The SOP and EOP have to be extracted from the header. The 

28 SPI interface will also assert the SOP and EOP signals. The lookup engine will determine 

29 the set that uses and asserts to the segmentation engine from the type of the packet. In 

30 egress mode, the device uses the header set of SOP and EOP signals depending on the 

31 type. For example, if the switch fabric is a packet switch fabric, then the lookup engine 

32 will use the SPI interface set. If the switch fabric is a cell one, then the lookup engine will 

33 use the set parsed out of the header. When the header is stripped, then the number of 

34 valid bytes has to be adjusted accordingly. The data has to be packed in 64 bit transfers to 

35 the segmentation engine. 
36 

37 Special Header Addition 

38 If the type is "010", then the flow will be handled differently. The incoming frame 

39 structure will look like the following: 

40 See Figure 128. The CID and the CLASS will become the new FID of the flow. 

41 The rest of the incoming header will be used as the internal header. If the "SPCL LK" is 
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1 set, then the encapsulation header is added. If it is not set, then the header is extracted and 

2 no special header is added. The memory will have the structure depicted in Figure 129. 

3 The memory location will be read and the length will indicate the number of valid 

4 bytes that have to be appended before the data as a special header. The header can be up 

5 to 16 Bytes long. The rest of the bits such as the EFCI, CLP and the 0AM are driven, too 

6 as out of band signals. See Figure 130. The byte valid from the SPI interface to the 

7 segmentation engine has to be adjusted for the fact that the extra bytes have been added. 

8 The data has to be packed if the number of byte is not on 64 bit boundary. 
9 

10 Hash Index Generation 

1 1 The extracted label is concatenated with the input port to produce the key. Keys 

12 may be padded out to 32 bits if they are less than 32. The key is hashed in hardware 

13 using the irreducible ETHERNET CRC32 polynomial listed below and three irreducible 

14 trinomial polynomials: 

15 1. G(X) = x^' + x2'+x^' + x^'+x^'+x''-hX*^ + x>Vx«+xVx^+xSx'+ 

16 X* + l; 

17 2. G(X) = X^' +X'^+ 1; 

18 3. G(X) = X^*+X^+1; 

19 4. G(X) = X^*+X^+1; 



20 The key is hashed 32 bits at a time, requiring 1 cycle for all keys. See Figure 131. The 

21 type will determine if there is a need for hashing. The tag will be extracted in the first 

22 clock because it is available in the first 64 bits transfer. The key will be formed. The key 

23 will then be hashed using four different hashing functions. The hashing is done in 

24 parallel. The first hashed key or index will be used as a base register to read two memory 

25 locations. As soon as the first location is read, then the key that is stored in that memory 

26 location is compared to the hashing key. If there is a match, then the read operations are 

27 halted. The content of the memory of the matching key will be used. If there is no match, 

28 then the next location content will be compared and so forth. The second index will be 

29 used to read the third and forth records if the first record of two bins fails to get a match. 

30 Then the third and the forth indices are used for the second memory. If all four indices 

31 fail to get a match, then the CAM is accessed. If the CAM contents do not match, then 

32 the traffic is forwarded to the CPU. 
33 

34 Timing Diagrams 



35 See Figure 132. The transfer size is assumed to be 64 bytes for up to eight 

36 transfers for SOP SOB data traffic. 
37 

38 Segmentation 

39 The segmentation engine segments the incoming packets into 52-byte ATM, 48 

40 bytes AAL-5, or switch cells and stores them in memory as cells of 64 bytes. The 

41 segmentation engine adds the necessary number of pad bytes to complete the cell. 
42 

43 Main Features 

44 - Segments incoming data from SPI-4 on per port basis, supports maximum of 

45 64 ports 

46 - Segments incoming data from CPU interface 
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1 - 64K bytes maximum packet 

2 - Segments, maps or de-maps incoming data into either ATM/AAL-5 cells or 

3 switch cells 

4 - Stores segmented cells in memory as cells of 64 bytes 

5 - Supports the traffic depicted in Figure 133. 

6 - Types of segmentation and applications for various traffic types. 

7 - Backpressures the scheduler in the egress side from the per flow Q in the 

8 ingress side via status cells through a serial interface to segmentation engine. 

9 - Calculates CRC-32 for packets segmented into AAL-5 cells. 

10 - Calculates new CRC-32 for L2 header. 

1 1 - Supports progranmiable maximum transfer unit up to 16K bytes for packet 

12 data. 

13 - Each port has a 32-bit counter to count packets coming from that port. 

14 - Each port has a 48-bit counter for data through that port. 

15 - Tags bad packets that they can be discarded later. 

16 - Internal memories to store parameters and data. 

17 o Dual-port SRAM 

18 ■ Control - 65 x 39 bits to store packet length, and cell byte count 

19 ■ Data - 640 x 64 bits to store data 

20 ■ CRC - 65 X 32 bits to store the partial CRC for each port 

21 ■ Statistic - 65 x 80 bits to store packets and data received for 64 

22 ports, and one for CPU 

23 o Single-port SRAM 

24 ■ Buffer - 80 x 7 bits to maintain the free buffer list for data 

25 SRAM 

26 ■ Queue - 80 x 70 bits to queue information for the complete 

27 cells 

28 - Operates at 200 MHz 
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1 

2 Functional Description 

3 See Figure 134. There are three input data paths to the segmentation engine: data 

4 from SPI-4, CPU, or serial interface. There are two output data paths from the 

5 segmentation engine: to memory manager, and serial interface to the scheduler. 

6 See Figure 135. The segmentation engine receives data from the incoming SPI-4 

7 during normal operation. Based on the type of application, the segmentation engine 

8 segments, de-maps, or maps the incoming data to either ATM/AAL-5 or switch cells. 

9 As shown in Figure 133 there could be five different types of segmentation (as opposed 

10 to types of application) performed by the segmentation engine. The following explains 

1 1 the operation of the segmentation engine for each of these types. 
12 

13 TYPE 1 Segmentation 

14 See Figure 136. Type 1 segmentation is for application type 0, 4, 8, 9, 10, 1 1,12, 

15 and 14. The segmentation engine receives the 64-byte cells from the look up engine and 

16 queues the cells to the output control block when the entire cell has been received. 
17 

1 8 TYPE 2 Segmentation 

19 See Figure 137. Type 2 segmentation is for application type 1 and 5. The 

20 segmentation engine receives the 64-byte cells from the look up engine and queues the 

21 cells to the output control block when the entire cell has been received. The cells come in 

22 as 52 bytes ATM data with 12 bytes of pad and output as 48 bytes of AAL5 data with 16 

23 bytes of pad to the memory manager block. The segmentation engine removes the 4 bytes 

24 ATM header and appends 4 bytes of pad to the end of the cell. 
25 

26 TYPE 3 Segmentation 

27 See Figure 138. Type 3 segmentation is for application type 2 and 13. The 

28 segmentation engine receives the 64-byte cells from the look up engine and queues the 

29 cells to the output control block when the entire cell has been received. The cells come in 

30 as 64 bytes packet data and output as 48 bytes of AAL5 data with 16 bytes of pad to the 

3 1 memory manager block. The segmentation engine segments the incoming data into 48 

32 bytes, appends 16 bytes of pad, and passes the control information for the completed cell 

33 to the output control block. CRC-32 is calculated when the cell is being sent out to the 

34 memory manager block. 
35 

36 TYPE 4 Segmentation 

37 See Figure 139. Type 4 segmentation is for application type 3 and 6. The 



38 segmentation engine receives the 64-byte cells from the look up engine and queues the 

39 cells to the output control block when the entire cell has been received. The cells come in 

40 as 64 bytes packet data and output as 64 bytes of AAL5 like data to the memory manager 

41 block. The segmentation engine segments the incoming data into 64 bytes and passes the 

42 control information for the completed cell to the output control block. The segmentation 

43 engine calculates CRC-32 and appends an AAL5 like trailer at the end. However, CRC- 

44 32 is calculated when the cell is being sent out to the memory manager block. 
45 
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1 TYPE 5 Segmentation 

2 The segmentation engine actually is not segmenting data in type 5. When the 

3 ingress segmentation engine receives a status cell from the egress PFQ through the serial 

4 interface, it queues that status cell to the memory manager like any other cells. The status 

5 cell is sent through the switch to the egress segmentation engine, and the egress 

6 segmentation engine sends the status cell to the ingress scheduler through a serial 

7 interface. See Figure 140. The serial input and output data paths provide a way for the 

8 per flow Q on the egress side to back pressure the scheduler on the ingress side from 

9 sending data. 
10 

1 1 CPU injected packets 

12 The CPU can inject data through software command as described later in section 

13 1.6.2. When the segmentation engine detects the CPU inject packet command it sends the 

14 cell out in the next control pop from the memory manager. However, there can only be no 

15 more than one CPU cell sent within 64 cells time. The segmentation engine starts a cell 

16 counter whenever it sends a CPU cell, and no more CPU cells will be queued out until the 

17 counter reaches 64, 

18 As shown in Figure 141, the CPU can write and read the internal registers inside 

19 the segmentation engine through the CPU interface. The CPU can also input packets into 

20 the segmentation engine through the CPU interface. 
21 
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1 

2 Block Description 

3 See Figure 142. 
4 

5 Description of INPUT CONTROL block 

6 In normal operation, when the segmentation engine has room in its FIFO for 

7 incoming cells it asserts a FIFO not full signal to the look up engine and waits for a valid 

8 signal from the look up engine to indicate there is valid control information on the bus. 

9 When there are valid controls, the input control block performs read from various internal 

10 memories: 

1 1 a) Reads from the FBL SRAM to get two pointers to the data SRAM, 

12 b) Uses the port address to read the SEG_TBL SRAM to get the per port parameters. 

13 c) Uses the port address to read the STATISTIC SRAM to get the statistics for that 

14 port, 

15 The per port parameters (SOP, HOP, etc.) from the bus are compared with those from 

16 the SEG_TBL, and error conditions are set accordingly. The port address from the bus is 

17 compared to the CUR_PID, which stores the last port address processed, to see if the 

18 incoming burst is from the same port address. If so, it means the data burst is the 

19 continuation to the one previously processed, and the control information in the internal 

20 registers are used instead of those read from the memories. Otherwise, it means the 

21 incoming data burst is for a new port address and new controls are to be used, and the 

22 controls for the last processed one are to be stored before the new ones are loaded. 

23 Detailed information is discussed later in the timing flow. 

24 The input control block has two sets of control parameters, one for the last processed 

25 port address, i.e. the current parameters, and one for the port address from the look up 

26 engine that comes with the valid signal. 

27 The controls, statistics, and the two pointers are then passed to the data block with a 

28 start signal to start the data state machine. 

29 The input control block also receives FIFO status from the FBL and QUEUE blocks 

30 and generates the FIFO not empty signal to the look up engine. 
31 

32 Description of FBL block 

33 The FBL (Free Buffer List) block keeps track of the open entries in the data 

34 SRAM. After power on reset, the CPU needs to write the init bit in the segmentation 

35 register to start the self- initialization of the free buffer list. The self-initialization takes a 

36 little over 80 cycles, and no data is allowed through the segmentation engine block until 

37 the initialization is done. 

38 The FBL block receives the read strobe from the control block. For each read 

39 strobes the FBL block reads two locations from the FBL SRAM and passes the two 

40 pointers back to the control block. 

41 The FBL block receives the write strobe and the return pointer from the data 

42 block. It writes the return pointer to the write address when there is no read pending. 

43 The FBL block generates a full status to the control block if there are no more free 

44 entries in the data SRAM. 
45 
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1 Description of FBL SRAM block 

2 This is an 80 X 7 bits single-port internal SRAM to store pointers to the free 

3 memory blocks in the data SRAM. The CPU needs to set the init bit to start the self- 

4 initialization after reset. 
5 

6 Description of SEG.TBL SRAM block 

7 This is a 65 X 19 bits single-port internal SRAM to store the parameters for the 64 

8 ports plus CPU. This RAM is accessed twice by the control block for each cell comes in 

9 for a particular port, one read to retrieve port parameters and one write to return 
10 parameters for that port. 

11 

12 Description of STATISTIC SRAM block 

13 This is a 65 X 80 bits dual-port internal SRAM to store number of packets (32 

14 bits) and data bytes (48 bits) received on the 64 ports and CPU interface for statistic 

15 purpose. 
16 

17 Description of DATA PIPELINE block 

18 The DATA PIPELINE block delays the incoming data and byte valid to match the 

19 delay for control signals through the INPUT CONTROL block. 
20 

2 1 Description of DATA block 

22 The data block receives the control information from the input control block and 

23 decodes the types of application to be performed by its input state machine. The input 

24 state machine starts when the data block receives the start signal from the control block. 

25 The input state machine has five different branches based on the segmentation types 

26 decoded from the application types. For each of the branch the input state machine 

27 performs the following operations and writes the data into the data SRAM: 

28 a) Add 8 bytes of pad 

29 b) Add 16 bytes of pad 

30 c) Segment incoming data into 48 bytes and add 16 bytes of pad 

31 d) Segment incoming data into 64 bytes 

32 e) Pass through the 64 bytes 

33 The main function of this input state machine is to accumulate a 64 bytes cell for each 

34 port address. When there is a completed cell, the input state machine generates a control 

35 signal to queue the control information for that particular cell into the QUE FIFO in the 

36 queue block. At the same time, the data block writes the statistics back into the STAT 

37 SRAM and sends the parameters back to the input control block. 

38 There is a data byte counter which increments for each incoming data bytes to count 

39 for the 64 bytes cell. For a cell with EOP in it, the counter value is sent to the memory 

40 manager indicating the number of bytes valid in the EOP cell. 

41 For AAL5 or like data cell, the data block also check to see if a new cell is needed for 

42 the trailer. If needed so, the data block writes the data/trailer to the buffer entry indexed 

43 by the second pointer. Otherwise, the data block returns the second pointer to the end of 

44 the free buffer list. 

45 There is also an output state machine, which starts when the queue block requests to 

46 de-queue from the data block. This output state machine receives the pointer to the data 
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1 SRAM and the control information to be processed on the cell. The output state machine 

2 reads from the data SRAM with the buffer pointer and output the data to the memory 

3 manager. When the data block finish sending data to the memory manager it returns the 

4 pointer back to the free buffer list. 

5 If the control information indicates a need to generate CRC-32, the data block reads 

6 the CRC SRAM with the port address from the control information and load the partial 

7 CRC into the CRC-32 engine. At the end of the cell and not EOF, the data block writes 

8 back the partial CRC into the CRC SRAM. If the control indicates a need to calculate 

9 new CRC for the L2 header, the output state machine replaces the existing CRC bytes 

10 from the packet with the ones calculated with the new L2 header, and the new values are 

1 1 sent through the CRC engine to calculate for the AAL5 trailer. 

12 There are two statistic counters in the data block, one for the number of packet 

13 received and the other for the number of data received. These statistics are kept on a per 

14 port basis, where the data block load the information from the STAT SRAM with the 

15 load signal from the input control block. 
16 

17 Description of DATA SRAM block 

18 This is a 640 x 64 bits dual-port internal SRAM to store the incoming data as the 

19 segmentation engine completes segmenting them into 64-byte cells. The memory is 

20 allocated into 80 blocks with each block has eight entries, total of 64 bytes per block. 
21 

22 Description of CRC_GEN block 

23 This block generates the CRC that covers the PDU (Protocol Data Unit) in the 

24 AAL-5 format. The polynomial used is CRC-32 as follows. 

G(x)=x32 + x2^ + x23^x^-+x'^ + x'2 + x"+x''' + x« + x^ + x5 + x* + x2 + x+l 

26 

27 Since the internal data bus has 64 bits, the CRC-32 is calculated in two portions: 

28 a) CRC-32 engine with 64-bit data input 

29 b) CRC-32 engine with 32-bit data input 

30 From the beginning of the data, CRC-32 engine with 64-bit input data bus is used 

3 1 all the way to the end of the data, i.e. right before the trailer for AAL5. Then it loads the 

32 remainder from the CRC-32 engine with 64-bit input data bus to the CRC-32 engine with 

33 32-bit input data bus and calculates the CRC with the four MSB bytes of AAL5 trailer. 

34 The CRC_GEN block reads the partial CRC with port ID as the address to the CRC 

35 SRAM. The CRC_GEN block loads the partial CRC into the CRC engine and starts 

36 calculating as the data are going out to the memory manager. When the cell is sent to the 

37 memory manager block, the CRC_GEN block writes back the partial CRC into the CRC 

38 SRAM, again using port ID as the address. At the end of a packet, the CRC is muxed out 

39 as part of the AAL-5 trailer. For L2 header, CRC is calculated for the packet and is used 

40 to replace the CRC bytes in the packet before calculating the CRC for the AAL5 trailer. 
41 

42 Description of CRC SRAM block 

43 This is a 65 x 32 bits dual-port internal SRAM to store partial CRC for each port. 

44 The data block reads and loads the partial CRC if the traffic type requires CRC-32 

45 calculation. The data block writes back the partial CRC when the completed cell is sent to 

46 the memory manager. 
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1 

2 Description of OUTPUT CONTROL block 

3 The OUTPUT CONTROL block sends the available signal to the memory 

4 manager to indicate that there are completed cells available to transfer. The memory 

5 manager replies with the control pop signal if it is ready to receive the completed cells. 

6 When there is control pop from the memory manager, the output control block reads the 

7 QUEUE FIFO for the control information for that cell and passes them to the data block. 

8 The data block performs read to the DATA memory as described in the data block above. 

9 The output control block sends the control information lining up with the output data. 

10 The output control has to work in compliance with the global sync, it only asserts the 

1 1 available signal in second clock slot from the global sync (see detailed timing in later 

12 section). The memory manager is to responds in the third clock slot, and the output 

13 control block generates the pop signal to the QUEUE FIFO in the following clock. The 

14 output control block also generates the read signal to the data memory in the same clock 

15 slot. 



16 If there is status cell pending, the output control block sends the status cell using 

17 one of the regular port accesses from the global sync, and the skipped cell takes over the 

1 8 CPU access during that particular global sync. This means CPU access is skipped for the 

19 global sync that has a status cell to send. The output control block returns to its normal 

20 port access in the following global sync. 
21 

22 Description of Q_FIFO SRAM block 

23 This is an 80 x 70 bits single-port internal SRAM to store control information for 

24 cells queued to the memory manager block. 
25 

26 Description of CPUJNTF block 

27 The CPU.INTF block is to interface with the external CPU. It supports CPU 

28 direct access to the internal registers and indirect access to the internal memories. CPU 

29 can also injected packet data through this interface. The CPU can inject packet/data 

30 through use of CPU inject data command, however, it can only injects a 64 bytes cell one 

31 at a time. As soon as there is a complete cell, the CPU_INTF block asserts a signal to the 

32 output control block to output the cell in the CPU access. 



33 

34 Description of PFQ_SEG INTF block 

35 The PFQ_SEG INTF block receives data from the serial interface between per 

36 flow Q and the segmentation engine. The PFQ_SEG INTF block shifts the serial data into 

37 a 64-bit shift register and load the output of the shift register into an internal register. The 

38 control information is extracted from the serial data. When the PFQ_SEG INTF block 

39 receives the entire status cell, it queues the completed cell into the queue FIFO. 
40 

41 Description of SEG_SCH INTF block 

42 The SEG_SCH INTF block receives data from the data block and outputs the data 

43 through a serial interface to the scheduler block. When the segmentation engine receives 

44 a status cell from the incoming SPI-4 interface, the data block outputs the data into the 

45 SEG_SCH INTF block instead of the data SRAM. As soon as the complete cell is 
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received, the SEG_SCH INTF block shifts the cell out the serial interface to the 
scheduler. 

ERROR CONDITIONS 

Several errors can be detected in the segmentation engine, and the error packet 
will be marked error by the SEG_ENGINE. The different errors are in Figure 143. These 
errors are reported in the status register. There are individual bits in the register to mask 
the individual errors from generating interrupt to the CPU. 

Memory Description 

Summary of Internal Memories: 

See Figure 144. 

Data structure for SEG_TABLE 

See Figure 145. The SEG_TABLE is a 65 x 39 bits internal SRAM addressed by 
the port ID (PID). The segmentation table has 65 entries, one for each port and CPU. 
BAD 
SOP 
EOP 

Cell Byte Count 
PTR 

Packet Length 



CLP 
EFCI 

CALC.CRC 
GAM 



This bit is set if detects error condition and needs to discard 
This bit is set if start of packet is received for the port 
This bit is set if end of packet is received for the port 
This field keeps track of the byte count received for the 64-byte cell 
The start address to the cell in the data SRAM 
: This field keeps track of the total length for the incoming packet for 
the port 

Cell Loss Priority, valid only with ATM traffic 
EFCI, valid only with ATM traffic 
Calculate L2 CRC. 
indicate GAM cell. 



Data structure for FREE_BUFFER 

See Figure 146. The FREE_BUFFER is an 80 x 7 bits internal SRAM that used 
to store the pointer to the free data block in the data SRAM. The segmentation engine 
takes a pointer from the top of this table to receive the incoming data. When the memory 
manager gets a complete cell off the queue the pointer to that cell is return to the end of 
this table. 

Data structure for Q_FIFO 

See Figure 147. This 80 x 70 bits internal SRAM that stores the queue of 
complete cells and the information for each queued cell. 
FID 
TYPE 
BAD 
SOP 
EOP 

CELL_LEN 
PTR 



Flow ID of the incoming data for the port. 
Traffic type of the incoming data for the port. 
Mark the packet to be discarded. 
Mark the start of the packet. 
Mark the end of the packet. 

Indicate the number of valid bytes in the 64-byte cell. 
The start address to the cell in the data SRAM. 
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1 PKT_LEN 

2 QOS 

3 0_PORT 

4 CLP 

5 EFCI 

6 CALC.CRC 

7 0AM 



Length of the packet, valid only for packet traffic. 
Quality of Service. 
Output port number. 

Cell Loss Priority, valid only with ATM traffic. 
EFCI, valid only with ATM traffic. 
Calculate L2 CRC. 
indicate OAM cell. 
8 

9 Data structure for DATA SRAM 

10 See Figure 148. The DATA SRAM is to store the incoming data in 64-bytes 

11 block. 
12 

13 Data structure for CRC_TABLE 

14 See Figure 149. The CRC Table is a 65 x 32 bits dual-port internal SRAM 

15 addressed by the port ID (PID). Each port has an entry to this SRAM to store the partial 

16 CRC generated for the AAL-5 traffic. 
17 

18 Data structure for STATISTIC 

19 See Figure 150. The Statistic SRAM is a 65 x 80 bits internal SRAM addressed 

20 by the port ID (PID). Each port has an entry to this SRAM to store the number of packets 

21 and data received for that port. Statistics for the CPU interface is in the last entry. The 

22 CPU interface accesses the content of this SRAM. 

23 PKT_RCVD : Number of packets received for each port 

24 DATA_RCVD : Number of data bytes received for each port 
25 
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1 

2 Interface 

3 

4 Interface Ports 

5 System 

6 See Figure 151. 

7 Segmentation <=> Lookup 

8 See Figure 152. 

9 Segmentation <=> Memory Manager 

10 See Figure 153. 

11 Segmentation O PFQ 

12 See Figure 154. 

13 Segmentation O SCH 

14 See Figure 155. 

15 Segmentation O CPU Interface 

16 See Figure 156. 
17 

18 Timing Diagrams 

19 See Figure 157. 
20 

21 Memory Access 

22 See Figure 158. The bolded "Rd" indicates that there are data dependencies for 

23 these reads. The current PID being processed is compared with the incoming one, the 

24 segmentation uses the current per port parameters if they are equal. Otherwise, it uses 

25 those read from the memories. 
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1 

2 Registers Description 

3 

4 Registers 

5 See Figure 159. 
6 

7 Commands 

8 Sunmiary of commands 

9 See Figure 160. 

10 CPU Inject Packet 

11 See Figure 161. 

12 Read SEGFBL memory 

13 See Figure 162. 

14 Write SEG FBL memory 

15 See Figure 163. 

16 Read SEG CRC memory 

17 See Figure 164. 

1 8 Write SEG CRC memory 

19 See Figure 165. 

20 Read SEG STATISTIC memory 

21 See Figure 166. 

22 Write SEG STATISTIC memory 

23 See Figurel67. 

24 Read SEG QUEUE memory 

25 See Figure 168. 
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1 Write SEG QUEUE memory 

2 See Figure 169. 

3 Read SEG DATA memory 

4 See Figure 170. 

5 Write SEG DATA memory 

6 See Figure 171. 

7 Read SEG TBL memory 

8 See Figure 172. 

9 Write SEG TBL memory 
10 See Figure 173. 
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1 

2 Principles of Operation 

3 The segmentation engine has three different sources for its incoming data: SPI-4, 

4 CPU interface, and status cells from per flow Q. The control information for the complete 

5 cells are queue into the queue FIFO, and are output to the memory manager following 

6 global sync scheme. If there is a status cell from per flow Q, it will be output in the next 

7 available cell slot, and the CPU slot will be used to compensate for the port that its slot is 

8 taken by the status cell. The segmentation engine resumes to the global sync in the 

9 following global sync cycle. 

10 In normal operation, the segmentation gets its data from the SPI-4. If the FIFO 

1 1 full signal is not asserted, it means the segmentation engine is ready and has room in its 

12 FIFO to receive data from the SPI-4. The look up engine then sends the control 

13 information together with the data to the segmentation engine. For the first data cell of a 

14 packet for a particular port the segmentation engine sets the SOP bit for that port in the 

15 segmentation table, writes data into the data FIFO, increments both the packet length and 

16 cell count as data come in. The data state machine keeps receiving data from the look up 

17 engine until the end of data burst, complete cell size, end of packet, or if there is data 

18 coming from the CPU or serial interface. 



19 If it is the end of data burst only, the segmentation engine loads the packet length 

20 and cell count back into the segmentation table for that port. The segmentation engine is 

21 now ready to receive the next data burst. 

22 If the segmentation engine has a complete cell size, it queues the cell by writing 

23 the cell information into the Q_FIFO. The segmentation engine resets the cell count, 

24 loads the packet length back into the segmentation table. Again, the segmentation engine 

25 is now ready to receive the next data burst. 

26 If it is the end of a packet, the segmentation engine checks to see if the traffic type 

27 is for packet to AAL-5. If so, the segmentation engine checks to see if another new cell is 

28 needed for the AAL-5 trailer. The segmentation engine then queues the completed cell(s) 

29 into the Q_FIFO, and then resets the entire entry for that port in the segmentation table. 

30 For packet data to AAL-5 cells, the CRC_GEN calculates the ATM CRC-32 when the 

3 1 cells are going out to the memory manager. The partial CRC is stored on per port basis, 

32 and the partial CRC is loaded back into the CRC generation engine for each cell queued 

33 out the particular port. 

34 The segmentation engine can process multiple open ports. For each port ID it 

35 reads from the segmentation table the parameters for that port, receives data for that port, 

36 stores the parameters back into the table, read the parameters for a different port from the 

37 table and so on. 

38 The segmentation engine queues a completed cell into the Q_FIFO and asserts the 

39 SEG_MEM_AVAILABLE signal to the memory manager block. The memory manager 

40 block pops the Q__FIFO to get the information (FID, PID, etc.), and the segmentation 

41 engine reads the data from the data SRAM using the pointer from the Q_FIFO. The 

42 segmentation engine sends the data to the memory manager with timing according to the 

43 global sync. After the segmentation engine sends the completed cell to the memory 

44 manager, it returns the pointer to the FBL. 

45 If the incoming data is from the CPU interface, the segmentation engine receives 

46 data from the CPU_FIFO, strips the header from the cell and extracts the control 
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1 information from the header. The segmentation engine then waits to complete the cell and 

2 queues the completed cell to the Q_FIFO as in normal operation. 

3 If the incoming data is from the serial interface from per flow Q, the segmentation 

4 shifts the serial data in, packs to 64-bit bus and store in an internal FIFO. The header of 

5 the incoming cell is stripped and control information such as flow ID and traffic type are 

6 extracted from the header. The segmentation engine waits to complete the 64-byte cell 

7 and queues it into the control FIFO (Q_F1F0). When the segmentation engine detects a 

8 status cell as the traffic type coming through the SPI interface it queues the complete cell 

9 into a separate FIFO in the SEG_SCH_INTF block and send the data out in a serial 
10 format to the scheduler. 

11 

1 2 Memory Manager 

13 

14 Features 

15 The Memory Manager (MM) block manages all the data and control traffics 

16 from/to the Segmentation, Per Flow Queue, Reassembly, and external SDRAM (or 

17 DRAM). Input cells (data) are sent from the Segmentation block to the MM block. The 

18 ATM cells and packets are sent to the Memory Manager block in the "internal cell" 

19 format. Internal cell size is determined at the power CPU of the chip (i.e. programmable). 

20 Figure 175 depicts the top level interface of the Internal Memory Manager block with 

21 others. 



22 The Memory Manager block has several main features listed as follows: 

23 1 . Support up to 5 12 Mbytes of external storage Memory using DDR- 

24 SDRAM. 

25 2. Cache the incoming "bypass" cells so that they can be transferred to the 

26 Reassembly block with minimum latency (without going through the 

27 external SDRAM path). 

28 3. Act as a buffer for data going from the Segmentation block to the external 

29 SDRAM. 

30 4. Act as a buffer between the incoming burst from the external SDRAM and 

31 the output to the Reassembly block. 

32 5. Receive Cell Data and Control Information from the Segmentation block 

33 and initiate En-queuing process. 

34 6. Receive De-queuing information (such as BID, FID) from the Per Flow 

35 Queue, get requested data from either the Internal Memory or the external 

36 SDRAM, and then send them to the Reassembly block along with other 

37 related information. 

38 7. Interface with the Free Buffer List (FBL) inside the Per Flow Queue 

39 block: 

40 a. Send En-queuing request to FBL to receive a BID indicating the 

41 En-queuing cell's data location in DDR-SDRAM. 

42 b. Send De-queuing BID to FBL to release the location pointed by 

43 the BID. 

44 8. Manage the processes and orders of reading from the SDRAM and writing 

45 to the SDRAM to maximize the overall bandwidth of the SDRAM. This 
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1 relates to the strategy of selection of banks of memory to read and write to 

2 reduce the overhead penalty. 

3 9. CPU Interface: All memories of the Memory Manager can be accessed 

4 (read/write) by the microprocessor through the CPU interface: 

5 a. Internal Memory 

6 b. CAM, Internal Memory List (IML), Free Internal Memory List 

7 (FIML) 

8 c. external SDRAM (by Queuing CPU read/write requests) 

9 10. Frequencies: 200 Mhz 
10 See Figure 174. 

11 

12 Functional Description 

13 Refer to Figure 174 for the top level block diagram of the Memory Manager 

14 (MM) block and Figure 175 for the block diagram of the Internal Memory Manager 

15 (IMM). The incoming cells are sent to the Internal Memory Manager block by the 

16 Segmentation block along with other information for the En-queuing process. After 

17 getting the En-queuing BE) from the Free Buffer List block, the IMM synchronizes the 

18 BID with all the information from the Segmentation block and then send them to the Per 

19 Flow Queue block for En-queuing. 

20 Incoming cells are temporarily stored in the Internal Memory (internal SRAM) 

2 1 that is controlled by the Internal Write Manager block. If cells are specified as "bypass" 

22 indicated by the Type input, they stay in the Internal Memory and then they are sent to 

23 the Reassembly block directly during the De-queuing process. If incoming cells are 

24 specified as "non-bypass" indicated by the Type input, they are temporarily stored in the 

25 Internal Memory, and then moved to the external SDRAM. The moving of all "non- 
26 bypass" data is handled by processing all the SDRAM write requests with En-queuing 

27 BID/IMBID inside the SDRAM Write Request Queues block. During the De-queuing 

28 process, if some of these cells are called to be de-queued (i.e. CAM Hit) before they are 

29 sent to the SDRAM, their status will be changed in the CAM and they will stay in the 

30 Internal Memory instead of moving out to the SDRAM. These cells will be sent to the 

31 Reassembly block directly. Right after a cell is taken out of the Internal Memory, the 

32 CAM and the Free Internal Buffer List are Updated immediately. 

33 During the De-queuing process, the Per Flow Queue interfaces with the Internal 

34 Memory Manager block with a De-queuing BID. The IMM will check in the CAM to see 

35 whether the requested cell (data) is inside the Internal Memory (SRAM) or inside the 

36 external SDRAM. If CAM indicates CAM Hit and the requested data is a "bypass" type, 

37 it means that the requested cell is inside the Internal Memory. In that case, the Memory 

38 ID Manager (MIM) will issue a command to the Internal Read Manager to read the CAM 

39 hit data out of the Internal Memory; the MBID (internal buffer ID) will be provided to 

40 the Internal Read Manager. The MM also modifies the CAM table so that this specific 

41 data will not go to the external SDRAM (it will be marked as "bypass") and it signals the 

42 De-Queue Request Manager block not to send a read SDRAM request to the SDRAM 

43 Read Request Queues block. 

44 During the De-queuing process, if the De-queuing BID can not be found inside 

45 the CAM, it means that the requested data is inside the external SDRAM. In this case 

46 MM will signal the De-Queue Request Manager block to send a read SDRAM request to 
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1 the SDRAM Read Request Queues block with the BE) and MBID (provided by the 

2 MIM block from the Free Internal Buffer List). The SDRAM Read Manager block takes 

3 read requests from the SDRAM Read Request Queues and processes them to through the 

4 SDRAM Manager (outside of the Internal Memory Manager block). Read-back data from 

5 the SDRAM will go through the SDRAM Read Manager block with the data itself, the 

6 BID and the EVBID assigned to it previously. The data and IMBID will go to the Internal 

7 Write Manager which will open the gate for the data to be stored inside the Internal 

8 Memory block at the IMBID location. As data enters into the Internal Memory, the CAM 

9 table will also be Updated properly (it will be marked as "bypass"). The Internal Read 

10 Manager then will schedule the data to go out to the Reassembly properly based on the 

1 1 order of the De-queue requests came in previously (stored in the De-queue Queue block). 

12 Microprocessor can access (read/write) the external SDRAM through the CPU 

13 data bus for testing and diagnostic purposes. The access can be random and data are not 

14 managed or shaped. The CPU Interface block inside the Memory Manager sends read or 

15 write requests from/to the SDRAM Read Request Queues and the SDRAM Write 

16 Request Queues, respectively. The CPUID/BID will be queued along with the 

17 IMBID/BID. The SDRAM Read Manager and the SDRAM Write Manager will send 

18 these information and associated data to the SDRAM BW Optimizer based on the bank 

19 selection strategy inside it. Data coming back from the SDRAM is routed properly to the 

20 CPU Interface block based on the returned CPUBID/BID information. 
21 

22 Block Description 

23 

24 Internal Memory Manager 



25 Refer to Figure 175 for the block diagram of the Internal Memory Manager. 

26 

27 Internal Memory Block 

28 This is an internal dual-port SRAM memory. It can run at 200 Mhz with 5 ns 

29 access time. The input data bus width is 128 bits and the output data bus width is also 128 

30 bits. The input data, IMBID and control signals are provided by the Internal Write 

3 1 Manager block. The output data go to the Internal Read Manager in response to the read 

32 request along with IMBID provided by the Internal Read Manager block. 

33 The size of the SRAM is currently defined as 64 Kbytes (= 4K x 128 = IK of 64- 

34 byte cells). 
35 

36 Memory ID Manager (MIM) Block 

37 This block consists of a CAM, a Free Internal Memory List (FIML), an Internal 

38 Memory List (IML), and a MM (Memory ID Manager) Controller. Refer to Figure 3 and 

39 Figure 4 to understand the mechanism of the CAM, FIML, and the IML blocks. FIML 

40 and IML are internal SRAMs. The CAM contains the BID values. The MIM Controller 

41 block manages all the incoming conmiands to Update the CAM, FIML, and IML blocks. 
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1 It also arbitrates all the Update commands from different blocks as they can happen at the 

2 same time. 
3 

4 Free Internal Memory List (FIML): 

5 This list contains IK entries (i.e. to support 1024 cells inside the Internal Memory 

6 (SSRAM)). The content of each entry is the IMBID that is an address points to the 

7 Internal Memory List (IML) to get IMBID and status. It also corresponds to the output 

8 address of the CAM. The FIML contents are initialized from 0 to 1023 when an 

9 initialization command is received from the CPU interface. 

10 The FIML has 2 pointers: TakePtr and RetumPtr. TakePtr and RetumPtr are 

1 1 initialized at the top of the List, RetumPtr points to location 0 and TakePtr points to 

12 location 1 . TakePtr points to a free IMBID value. TakePtr is incremented by 1 every time 

13 an IMBID is taken. RetumPtr points to a not-available IMBID value. Every time a 

14 location is returned to the FIML, a new (i.e. returned) IMBID value will be written into 

15 the location that RetumPtr currently points to. Now that location represents a free cell 

16 with the returned IMBED points to a free cell inside the Internal Memory. RetumPtr is 

17 also incremented by 1 every time an IMBID is retumed. 

18 When TakePtr and RetumPtr reach above the value of 1023, they will be rolled 

19 over to the value of 00. TakePtr and RetumPtr will never be equal. If (TakePtr - 

20 RetumPtr) is equal to -1, then the IM is full and the IM_Full bit of the 

2 1 Mem_Status_0_reg will be set. If (TakePtr - RetumPtr) is equal to 1 , then the IM is 

22 empty and the IM_Empty bit of the Mem_Status_0_reg will be set. The status of Full-l is 

23 also recorded in the IM_Full_l bit. 

24 The CPU interface should be able to write and read to/from the FIML memory: 

25 1). CPU_Wr_FIML{ input: Adr, Wr_value } : 

26 This command allows the CPU interface to write to the FIML table where Adr is the 

27 address value (range from 0 to 1023) and Wr_value is the content that we are going to 

28 write to. Refer to Register Description section. 

29 2). CPU_Rd_FIML{ input: Adr; output: Rd_value } : 

30 This command will read the content of the FIML table at location Adr. Refer to Register 

31 Description section. 
32 

33 Internal Memory List (IML): 

34 The IMBID value is the address of the IML block. The content of each entry 

35 contains 3 bits: the valid bit, bypass bit, and readback bit. 

36 ■ Valid: this bit indicates that at IMBID location in the Internal Memory, a 

37 cell is valid. 

38 ■ Bypass: IMBID points to a cell that is whether a bypass or non-bypass 

39 type. A bypass type cell will not go into the external SDRAM, it is ready 

40 inside the IM to go out to the Reassembly. A non-bypass type cell is in 

41 transition to go into the external SDRAM. 

42 ■ Readback: indicates that this cell is from the SDRAM and ready to go out 

43 to the Reassembly block. 



44 

45 CAM: 
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1 CAM contains the values of BE)s (external SDRAM pointers) that are inside the 

2 Internal Memory. If a CAM hit occurs, an IMBID value will be returned by CAM. After 

3 receiving the IMBID, the MIM will go to the IML and read the content at IMBID. This 

4 content contains IMBID (Internal Memory address) and its status. If valid bit is equal to 

5 1, it means that IMBID is valid and, therefore, this CAM hit is valid. Otherwise, a "miss" 

6 is returned. 

7 There are 4 main internal commands (between blocks and not CPU commands) 

8 can be issued to the MM block (from other blocks): 

9 I). Take_ID <r IMBID: 

10 In this command, the MIM will access the FIML and take a free IMBID from it. The 

1 1 output of the command will be a free IMBID that represents a free cell inside the Internal 

12 Memory. The TakePtr will be incremented by 1 after it releases a free cell. 

13 2). Retum_ID{ input: IMBID}: 

14 In this command, the MIM will receive an IMBID as an input. It will return this pointer 

15 to the FML. The RetumPtr will be incremented by 1 after an IMBID is returned. 

16 3). Compare_CAM{ input: BID; output: IMBID, status, hit/miss}: 

17 In this command, input argument is BID and output is IMBID, status (valid bit, bypass 

18 bit, busy bit) and hit/miss bit. The MIM receives BID and compares it to the entries in the 

19 table. If a match is found, the CAM will output the corresponding IMBID. The MIM goes 

20 to the IML and read its content at the location IMBID. The MIM also checks to see if the 

21 "valid" bit is equal to 1 at this location. If it is equal to 0 then a miss has occurred, 

22 otherwise, it is a CAM hit. The content of IML at IMBID will include status, and a 

23 hit/miss bit. 

24 4). Wr_CAM{ input: IMBID, BID, status}: 

25 This command will modify the content of the CAM table and the content of the IML 

26 table with the new values of BID and status, respectively. 

27 5). Rd_CAM{ input: IMBID; output: BID, status } 

28 This command reads the contents of the CAM table and the IML table. 
29 

30 The CPU interface should be able to write and read to/from the CAM table: 

31 1). CPU_Wr_CAM{ input: IMBID, BID, status}: 

32 This command allows the CPU interface to write to the CAM table and the IML table 

33 with new values of BID, and status, respectively. 

34 2). CPU_Rd_CAM{ input: MBK); output: BID, status } : 

35 This command will read the content of CAM table (i.e. BID) and the content of IML 

36 table (i.e. status) 
37 

3 8 Internal Write Manager Block 

39 This block receives input from the Segmentation block and the SDRAM Read 

40 Manager, and manages the writing process into the Internal Memory (SRAM) block. 

41 There are 4 main functions that this block handles independently: 
42 

43 Function 1: (Segmentation^ Internal Memory). 
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1 1) Segmentation asserts Seg_Mem_Available signal to the MM block to let it know that 

2 there is data available in its fifo. 

3 2) MM block sends Mem_Seg_Cntl_Pop signal to pop the cell control information of a 

4 cell. 

5 3) Cell control information is sent to the MM block. 

6 4) At a fixed delay from the time that IWM gets the cell control information from the 

7 Segmentation block, the Segmentation block starts to send cell data to the MM block 

8 at 128-bit word at 10ns period (every other cycle of 200 Mhz clock). The 128-bit data 

9 word will be first latched into flip-flops inside the IWM. 

10 5) From FIML, get free IMBID: issue Take_ID command to MIM block. 

1 1 6) Write data into the Internal Memory: Wr_IM { IMBID; Seg_Data } . 

12 7) Update CAM with new BID: Wr_CAM{IMBID; BID, valid=l,bypass=0/l, 

13 readback=0}. 

14 8) If (not "bypass"), 

15 a. Then put new BID/IMBID into end of SDRAM Write Request Queues: 

16 Wr_SWRQ{ end of queue; BID/IMBID } . 
17 

1 8 Note: This block has a small FIFO to receive data inputs from the Segmentation block. 

19 The size of the FIFO is 4 64-byte cells with 128 bit wide. Data will be stored in this IWM 

20 FIFO before it is sent to the Internal Memory. 
21 

22 Function 2: (Segmentation-^ Per Flow Queue). 

23 1) IWM sends the cell control information to the PFQ block with a command specified 

24 in Mem_PFQ_Com signals (i.e. "00": invalid; "01": enqueuing; "10": discard; 

25 "ir':release (for dequeuing)). 

26 2) PFQ receives the information, processes it and then 

27 a. If command = "0 1 ", then it sends back en-queue BID (Buffer ID) to the MM 

28 block 8 cycles later. This BID will be used as an address to write data into the 

29 external SDRAM. If BID = "7FFFFF', then that particular cell will be 

30 dropped and not written into the SDRAM. 

31 b. If command = "10", then it sends BID = "7FFFFF' back to the MM block. In 

32 this case, data will not be written into the external SDRAM. 



33 

34 Function 3: (Control data path to Internal Memory). 

35 Inputs and outputs of the Internal Memory are time shared as follows (to handle dual-port 

36 128-bit wide SRAM at 200 Mhz): 

37 1) Slot 0 (period = 5ns): input from Segmentation. 

38 2) Slot 1 (period = 5ns): input from SDRAM Read Manager. 
39 

40 Function 4: Microprocessor (CPU) Access. 



41 When switched to this mode, the microprocessor can write to the Internal 

42 Memory (IM) with a specified address. During this mode, there is only the CPU interface 

43 block that can access (read/write) to the IM block. Refer to the Register Description 

44 section.. 

45 CPU_Wr_IM{ input: Adr, Wr_value} : write to IM at the Adr location with the 

46 value of Wr_value. 



65 



AZA-OOl/2001-POO 




PATENT 



1 

2 SDRAM Read Manager Block 

3 This block receives read requests from the SDRAM Read Request Queues and 

4 transfers those requests with data to the SDRAM Manager. It receives read-back data 

5 from the SDRAM Manager and sends it to the Internal Memory. 
6 

7 Function: (SDRAM-> Internal Memory). 

8 1) SDRAM Read Manager sends data with BID and MBID to the Internal Write 

9 Manager: Rd_SRRQ{ top of queue } <= BID, IMBID, SDRAM_Data. 

10 2) Request the Internal Write Manager block to write data into Internal Memory: 

1 1 Wr_IM{ IMBID; SDRAM_data} . 

12 3) Update CAM with new BID (send a command to the MIM): Wr_CAM{ IMBID; BID, 

13 valid=l, bypass=l, readback=l }. 
14 

1 5 Internal Read Manager Block 

16 This block receives input from the Internal Memory and manages its output to the 

17 Reassembly block and the SDRAM Write Manager. 
18 

19 Function 1: (Internal Memory ^ Reassembly). 

20 1) Look into CAM to see if BID from top of De-queue Queue matches with BID in 

21 CAM. Issue a command to MIM: Compare_CAM{BID; IMBID, status, hit/miss}. If 

22 hit/miss = 1, get IMBID as output from CAM. Put IMBID into the output queue 

23 inside the IRM block. Assert Mem_Ras_Available to the Reassembly block. 

24 2) The Reassembly block asserts the Ras_Mem_Cntl_Pop signal to the MM block. The 

25 IRM sends the cell control information at the top of the queue by popping from top of 

26 the De-queue Queue block. 

27 3) At a specified time slot the Reassembly block asserts the Ras_Mem_Data_Pop signal 

28 to the MM block. The IRM starts to read data from the IM, Rd_IM{ IMBID} <= 

29 Reassembly_Data. 

30 4) Send data to the Reassembly block. Pop IMBID from the output queue. 

31 5) If(REL=l)or(REL=0 and "bypass" = 1 and "readback"=l) 

32 Then return IMBID to Free Internal Memory List: Return ID (IMBID}. 

33 6) Update CAM: Wr_C AM{ IMBED; BID= 00, valid=0, bypass=0,readback=0 } . 

34 7) If(REL = Dthen 

35 Release BID to FBI in PFO: assert Mem PFO Valid hieh. Mem PFO Com = 

36 "II " (release BID) and Mem PFO Param = BID. 

37 (Refer to Multicast and BID/IMBID Release section) 
38 

39 Function 2: (Control data path to Reassembly and SDRAM). 

40 Inputs and outputs of the Internal Memory are time shared as follows (to handle a dual- 

41 port 128-bit wide SRAM at 200 Mhz): 
42 
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1 1. Slot 1 (period = 5ns): output to Reassembly. 

2 2. Slot 2 (period = 5ns): output to SDRAM Write Manager. 
3 

4 Function 3: Microprocessor Access. 

5 When switched to this mode, the microprocessor can read from the Internal Memory (IM) 

6 at a specified address. During this mode, there is only the CPU interface block that can 

7 access (read/write) to the IM block. 

8 CPU_Rd_IM{ input: Adr; output: Rd_value } : read from IM at the Adr location with the 

9 value of Rd_value. 
10 

1 1 SDRAM Write Manager Block 



12 This block receives write requests from the SRAM Write Request Queues and 

13 then gets data from the Internal Memory to send to the SDRAM Manager block. 
14 

15 Operation: (Internal Memory SDRAM). 

16 1) Get BID and IMBID from the top of SDRAM Write Request Queues (refer to 

17 SDRAM read/write strategy section for more details on algorithm to select proper 

18 banks of memory to read and to write). 

19 2) Look into MIM to check status of that BID: Rd_CAM{ IMBID; BID, status } . 

20 3) If "bypass" = 1, 

21 Then do not write data to the SDRAM. Return BID to FBL. Go to last 

22 step (5). 

23 Else if "bypass" =0, then get data from the Internal Memory: Rd_IM{ IMBID} <= 

24 IM_Data. Send IM_Data to the SDRAM Manager along with BID and IMBID. 

25 4) Return IMBID to the Free Internal Memory List and Update CAM: 

26 Retum_ID{IMBID} 

27 Wr_CAM{ IMBID; BID=00, valid=0, bypass=O,readback=0 } . 

28 5) Pop the BID and IMBID from the top of SDRAM Write Request Queues. 
29 

30 De-queue Request Manager Block 

3 1 This block receives De-queuing requests from the Per Flow Queue block with 

32 other related information. It will look into the CAM to see if there is a CAM hit. If it is, it 

33 will modify CAM to reflect its new status, that is now a "bypass" data and should not be 

34 written into the SDRAM. It then lets the Internal Read Manager take data out of the 

35 Internal Memory and send it to the Reassembly block. If there is no CAM hit, it will send 

36 a read request to the SDRAM Read Request Queues to get data out from the SDRAM. 
37 

38 Function: (Per Flow Queue Internal Memory, SDRAM). 

39 a. Store BID, PortID and other cell control related information for De- 

40 queuing process into the end of De-queue Queue. 

41 b. Look into CAM to see if BID is matched with BID in the CAM. 

42 Compare_CAM{BID; IMBID, status, hit/miss} 
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1 c. If BID is found in CAM (hit/miss = 1) and "readback" = 0 (not outgoing 

2 cell) 

3 If "bypass" = 0 

4 Then get IMBID. Modify CAM: Wr_CAM{IMBID; BID, valid=l, 

5 bypass= 1 , readback=0 } . 

6 Else if no CAM hit, 

7 Then get free IMBID from Free Internal Memory List: Take_ID. Write 

8 BID and IMBID into end of SDRAM Read Request Queue. 
9 

10 Multicast and BID/IMBID Release 

1 1 During Dequeuing the PFQ sends to the MM block a command to indicate 

12 whether to release the Dequeue BID to the Free Buffer List (in the PFQ) or not to release 

13 the Dequeue BID due to Multicast. Depeiiding on the type of data inside the Internal 

14 Memory, the IMBID is also released to the Free Internal Buffer List (FIBL) in the MIM 

15 block. Figures 176-178 depicts the mechanism of the release command: 
16 

17 SDRAM Manager (SM) 

18 See Figure 179. The DDR_SDRAM Controller (SM) receives the memory write 

19 (enqueue) and read (dequeue) requests from the Internal Memory Manager (IMM). All 

20 operations are done on a per controller basis independent of the other controller. Each of 

21 the two SDRAM Manager blocks, SMO and SMI consists of: 

22 • RD ID FIFO 

23 • WR ID FIFO 

24 • BANDWIDTH OPTIMIZER BLOCK 

25 • REQUEST SERVICE BLOCK 

26 • CONTROLLER INTERFACE BLOCK 

27 • BUFID ADDR DECODE BLOCK 

28 • DENALI DDR_SDRAM MEMORY CONTROLLER BLOCK 

29 SMO and SMI blocks each control corresponding pair of SDRAMs. It uses 128-bit 

30 data bus at 200 MHz in order to support 2X OC-192 transfer rate (20 Gbps). Each 

3 1 DDR_SDRAM Controller has the capability of peak data transfers at 16 bytes (or 128 

32 bits) per 5ns or 25.6 Gbps for a total memory bandwidth of 5 1 .2 Gbps in the 

33 DDR_SDRAM Manager (SM). Therefore, the DDR_SDRAM Manager must 

34 simultaneously process two memory accesses, one each to SMO and SMI in order to 

35 sustain the required bandwidth. 

36 The BANDWIDTH OPTIMIZER block decides which of 8 possible operations, 

37 4 reads from each bank and 4 Writes to each bank to service for optimal data throughput 

38 from the SDRAMs. It looks at the current state which is the last processed request and 

39 the outstanding requests in the per bank queues and picks one based on an algorithm 

40 detailed below to optimize the data throughput from/to the SDRAMs. Some of the 

41 avoided situations include sequential same bank operations with their bank activate and 

42 precharge delays, read followed by write commands with the associated pitfall of turn 

43 around cycle delays, etc. 
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1 The Bandwidth Optimizer Block will scan through the 8 queues, 4 reads and 4 

2 writes per Memory Manager, DM0 and DM1, and select an operation request to the 

3 appropriate bank based on the SDRAM ReadAVrite Strategy discussed below. 

4 • Each series of Read requests to external memory will have to follow by a Write 

5 request to a different bank of memory from the accessed bank of the last Read. 

6 • Within a series of Read requests, consecutive Reads should have different banks. 

7 • Within a series of Write requests, consecutive Writes should have different banks. 

8 For Example: 

9 Rd(bO) ^Rd(bl) -^Rdm ->Rd(b3) ^ Wr(bO) ^ Wr(bl) ^ Wr(b3) ... 

10 At the same time the read and write request BID/IMBID are stored in the READ 

1 1 ID FIFO and WRITE ID FIFO respectively to be returned to the IMM with the read 

12 data or a write request confirmation respectively. For write request, IMBID is returned so 

13 that IMM can update the CAM and release IMBID to Free Internal Memory List. For 

14 read request, BID/IMBID are returned so that IMM can store the read data to IM and 

15 release BID to Free Buffer List. 

16 The REQUEST SERVICE BLOCK is activated with the particular request to be 

17 serviced by the Bandwidth Optimizer block. It routes the data back and forth 

18 appropriately to and from the dram and communicates with the Address Decode block 

19 and the Memory Controller Interface block. It also handles the back pressure from the 

20 Denali Memory Controller in case the conunand queues or the data queues in that block 

2 1 have reached their limit. 

22 The BUFFERID ADDRESS DECODE BLOCK does the address translation of 

23 the BID to the absolute byte address and feeds this to the Memory Controller Interface 

24 block via the Request Service Block. 

25 The CONTROLLER INTERFACE BLOCK formats all the commands for the 

26 Denali Memory Controller. It processes the address and data part and sends them to the 

27 Memory Controller along with required control signals in the format expected by the 

28 Memory Controller. It also receives the data from the memory controller for read 

29 operations and sends it to the Request Service Block. It will pass on control signals from 

30 the Controller to the Request Service block signaling that the controller command or data 

31 queue is full and when it is ready to receive the next request 

32 From the above block onwards the interface of the DENALI MEMORY 

33 CONTROLLER and the SDRAMs is detailed in the diagram below. The Memory 

34 Controller takes in the Address and Data from the Memory Controller Interface Block 

35 and formats them into appropriate RAS and CAS commands for the DRAM memories. It 

36 also does the refresh of the SDRAM memories. 

37 See Figure 180, 

38 CPU Interface 

39 Assumption: CPU access requests are infrequent and are for debugging and testing 

40 purpose only. 

41 SM services the CPU request along with the en-queuing and de-queuing requests 

42 with equal priority. Whenever CPU requests an access to DDR-SDRAMs, SM will 

43 process the request at the end of one round scanning and servicing the 8 queues of the 

44 read/write requests. In order not to interrupt the bandwidth for enqueuing and dequeuing. 
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1 each CPU access will be serviced after X number of rounds of servicing the read and 

2 write queues. (The X number is programmable through a register) 

3 Since the en-queuing cells and de-queuing cells are 64-byte and hence the DDR 

4 SDRAMs are programmed to have burst length of 4 (64x8 = 4x 128), the CPU data aceess 

5 should be in the same cell unit, 64-byte. 
6 

7 Memory Description 

8 

9 SunMnary of Memories: 
10 See Figure 181. 

11 

12 Summary of CAM: 

13 See Figure 182. 

14 Interface 
15 

16 Segmentation Engine 

17 See Figure 183. 
18 

19 Per Flow Queue Engine 

20 See Figure 184. 
21 

22 DDR-SDRAM 

23 See Figure 185. 
24 

25 Reassembly 

26 See Figure 186. 
27 

28 CPU Interface 

29 See Figure 187. 
30 

31 Global Sync 

32 See Figure 188. 
33 

34 Interface with DDR-SDRAM 

35 The Memory Manager can support up to 5 12 Mbytes of external memories using 

36 DDR-SDRAM. The DDR-SDRAM device that is available at this time is the IM x 32bit 

37 X 4banks (128 Mbits). The memory size can be programmed using register 

38 Mem_Config[2:0] (refer to the Register section). 

39 2 DDR-SDRAM devices are required to support a data bus of 64bit. Hence 2 data 

40 strobes, Mem_SDR_DQS_Lx) and Mem_SDR_DQS_Hi, are used to for the lower and 

41 upper 32bit of the data bus respectively. 

42 I A duplication set (set 1) of memory address and control signals (including 

43 memory clock) is available for each controller in order to improve its signal driving 

44 capability when the number of devices are increasing (up to 16 devices per controller). 

45 Hence each interfacing signals will not drive more than 8 devices in any configuration of 

46 memory sizes. 
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1 This second set of address and control pins can be activated by programming the 

2 register Mem_Config[3]. 
3 

4 Example 1: 

5 Refer to Figure 1 89 for the configuration of 64 Mbytes of memory. 

6 Mem_Config[3] = 0. 
7 

8 Examyle 2: 

9 Refer to Figure 190 for the configuration of 256 Mbytes of memory. 
10 Mem_Config[3] = 1. 

11 

12 Register Description 

13 See Figure 191. 
14 

15 Commands 

16 CPU_Wr_SDRAM 

17 See Figure 192. 

18 CPU_Rd_SDRAM 

19 See Figure 193. 

20 CPU_Wr_ FIML 

21 See Figurel94. 

22 CPU_Rd_FIML 

23 See Figure 195. 

24 CPU_Wr_CAM 

25 See Figure 196. 

26 CPU_Rd_CAM 

27 See Figure 197. 

28 CPU_Init_MM 

29 See Figure 198. 



30 

31 Principle of Operation 

32 
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1 Initialization Setup 

2 At power-on-reset, the entire block is reset by MEM_Reset_L signal. After 

3 power-on-reset, all registers are at default values. 

4 CPU can issue the CPU_Init_MM command (refer to the "Register section") that 

5 will go through proper steps to initialize all internal memories inside the Memory 

6 Manager block. 
7 

8 Operational Flows 

9 

10 Normal Mode: 

1 1 The normal mode is the default mode. The Memory Manager handles both the 

12 enqueuing and the dequeuing processes. 
13 

14 Enqueue: 
15 

16 Segmentation to MM: 

17 1 . Segmentation block has data available for MM block, it starts to assert 

18 Seg_Mem_Available signal. 

19 2. MM block asserts Mem_Seg_Cntl_Pop signal to get cell control information from 

20 Segmentation block. 

21 3. At a specified time slot the Segmentation block sends cell control information to the 

22 MM block. 

23 4. The MM block stores cell control information in the IWM block. 

24 5. At a specified time slot the Segmentation block starts to send data to the MM block 

25 through the Data bus (128 bits) in 4 clock cycles at 10 ns period (every other clock 

26 tick of the 200 Mhz clock). 

27 6. The IWM block stores data inside the IWM FIFO (4 64-byte cells). 

28 7. After receiving the free BID from PFQ (see below), the IWM block write appropriate 

29 data from the IWM FIFO into the Internal Memory (IM). 
30 

31 

32 MM to PFQ: 

33 The IWM block sends cell control information to the PFQ block on the 

34 Mem_PFQ_Param along with a command via the Mem_PFQ_Com bus. The 

35 Mem_PFQ_Valid signal is asserted high to let the PFQ block knows that the data bus and 

36 command bus contains valid data. If the Discard signal is not asserted then the 

37 Mem_PFQ_Com value should be 01 (enqueue). . If the Discard signal is asserted then the 

38 Mem_PFQ_Com value should be 10 (discard). 

39 1 . If it is an enqueue command, the PFQ block sends back to the MM block a BID at a 

40 specified number of clocks later via the PFQ_Mem_Param bus.This BID is used as an 

41 address to write data into the external SDRAM. The PFQ_Mem_VaIid signal is 

42 asserted and the PFQ_Mem_Com value is 01 . 

43 2. IWM block receives BID and tags it along with the associated data being stored in the 

44 IWM FIFO. 
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1 3. If it is a discard conunand, the PFQ block sends back a BID with value of "7FFFFF' 

2 to the MM block at that specified time slot. In this case, data from the Write FIFO 

3 will not be written inside the external SDRAM (it will be flushed out of the FIFO). 

4 4. If the PFQ decides to drop the cell (even when the command is not a discard), it will 

5 send to the MM block a BID with value of "7FFFFF'. In this case, data from the 

6 Write FIFO will not be written inside the external SDRAM (it will be flushed out of 

7 the FIFO). 
8 

9 MM to SDRAM: 

10 

11 1 . Data from the Segmentation is temporarily stored in the IWM FIFO. 

12 2. After receiving a free and valid BID from the PFQ, the IWM block requests a free 

13 IMBID from the Memory ID Manager (MIM) block. The CAM is also updated 

14 properly with the new IMBID and BID. 

15 3. IWM writes data from the IWM FIFO to the Internal Memory (IM) block at address 

16 IMBID. 

17 4. If the cell is not a "bypass" cell, then IWM sends BID and IMBID to the SDRAM 

18 Write Request Queue (SWRQ) block. Place the information at the bottom of the 

19 queue that associates with memory bank of the BID address. 

20 5. The SDRAM Write Manager (SWM) block goes through the "Write Bank Selection 

21 Algorithm" and it selects a BID/IMBID pair from a particular queue inside the 

22 SWRQ block. The SWRQ sends that IMBID to the MIM block for status verification. 

23 6. If that cell is marked as a "bypass" cell (this means that it is being dequeued and 

24 ready to go out to the Reassembly block), then do not write data to the SDRAM. Pop 

25 the BID/IMBID from the top of the queue inside the SWRQ block. 

26 7. If that cell is not valid as indicated in the status (this means that it has gone out to the 

27 Reassembly block), then do not write data to the SDRAM. Pop the BID/IMBID from 

28 the top of the queue inside the SWRQ block. 

29 8. If that cell is marked as a "non-bypass" cell, then 

30 a). The SWM block sends request to the IRM block with BID/IMBID. 

31 b). IRM reads data from the IM block (Rd_IM{IMBID}) and send this data to the 

32 SDRAM Manager block along with BID/IMBID. 

33 c). Return IMBID to the MEM block and update its CAM: 

34 - Retum_ID{IMBID} 

35 - Wr_CAM{ IMBID; BID=00, valid=0, bypass=0, readback=0 } 

36 - Pop BID/IMBID from the top of the queue inside the SWRQ block 

37 Dequeue: 
38 

39 PFQ to MM: 

40 1 . At a specified time slot, PFQ sends to the Read Request Queue & Manager (RRQM) 

41 block dequeued cell control information that includes a BID via the Data2 bus. Both 

42 Valid2 and Deq signals are asserted to let the MM block knows that the data bus is 

43 valid and contains dequeue information. 

44 2. The RRQM block stores cell control information at the end of the queue. 

45 3. The RRQM block sends a BID from the top of the queue with a read request to the 

46 Memory Read/Write Controller (MRWC). 
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1 

2 SRAM to MM: 

3 1 . The MRWC block monitors the Read FIFO. If it is not full, the MRWC sends a 

4 dequeue BID to the SRAM Interface (SI) block with a read command ("10"). 

5 2. The SI block puts the BID and the command in a queue. 

6 3. The SI block executes a read command to the external SRAM with the BID as an 

7 address. Data is read out from the external SSRAM and sends to the Read FIFO. The 

8 associated BID will be placed inside the Read FIFO along with the data. 

9 4. When a complete cell is read back from the SSRAM, the Read FIFO sends its status 

10 and the associated BID to the MRWC block. 

11 5. The MRWC block sends the BID back to the PFQ block via the MM_PFQ_Data bus. 

12 The Valid is asserted and the Com value is "1 1" (dequeue). 

13 6. PFQ receives the BID and releases it back to the Free Buffer List block. 
14 

1 5 MM to Reassembly : 

16 1 . The Read Request Queue & Manager (RRQM) block monitors the Read FIFO status. 

17 2. Whenever there is a complete cell available inside the Read FIFO, the RRQM sends 

18 the MM_Ras_Available to the Reassembly block. 

19 3. The Reassembly block asserts the CntLPop signal to the MM block. The RRQM 

20 block sends the cell control information at the top of the queue to the Reassembly 

21 block at a specified time slot. 

22 4. At a specified time slot the Reassembly block asserts the Data_Pop signal to the MM 

23 block. The Read FIFO starts to send data out to the Reassembly block at a specified 

24 time slot via the 128-bit Data bus at 10 ns period (every other clock tick of the 200 

25 Mhz clock). 
26 
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1 Flow charts for Functional Operations 

2 See Figures 199-205. 
3 

4 Reassembly 

5 The Reassembly Engine is a state machine that reassembles the stored switch cells 

6 into outgoing ATM, AAL-5, packets, or switch cells. 

7 Main Features 

8 - Can provide a maximum of IM total flows. 

9 - Uses PER PORT queues for re-assembly. 

10 - Reassembles segmented cells back into IP packets. These packets were 

1 1 segmented into cells for switching at the ingress side and need to be 

12 reassembled back 

13 - Maximum packet size of 64 KB can be handled 

14 - AAL5 trailer field check for length and CRC-32 for all flows that have to be 

15 re-assembled and sent out as packets. 

16 - L2 CRC check for packets that have the L2 CRC check enabled. 

17 - Can strip up to 16 bytes of Header data. 

18 - Reassembles 48 byte AAL5 cells or 52 byte ATM cells, both stored as 64 byte 

19 memory cells into outgoing packets, ATM cells or switch cells. 

20 - No lookup, pass-through traffic 

21 - For ATM 

22 o VPI, VCI header translation 

23 o Operation, Administration and Maintenance (OAM), Cell Loss Priority 

24 (CLP) and Explicit Forward Congestion Indication (EFCI) replacement 

25 o Mark End of Packet in Payload Type Indicator (PTI) Field for AAL5 

26 Egress Cells 

27 o Optional MPLS tag append 

28 o Encapsulate raw cells into packets for routing through packet fabrics. 

29 o Encapsulate ATM cells into packets for routing through packet fabric. 

30 - MPLS lookup 

31 o POS (PPP/IP) packet format 

32 o MPLS push/pop 

33 o MPLS header translation 

34 o Decrement TTL and drop if TTL is one before decrementing 

35 - It has one re-assembly going on for each of the ports that are conFigured and 

36 active, supports a maximum of 64 ports plus a CPU port. 

37 - The output of this block goes to the PHY PORT. 
38 

39 
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1 - Supports the traffic represented in Figure 206. 

2 - Applications for various traffic types 

3 - Types of reassembly: 

4 o TYPE 1 : Ingress TM (ATM, AAL5, Packets) to 64B Switch Cells 

5 o TYPE 2: Ingress ATM cells to Packet Data 

6 o TYPES: Ingress A AL5 cells to Packet Data 

7 o TYPE 4: Ingress Packets to Packet Data 

8 o TYPE 5: Egress (ATM) to ATM cells with Translation 

9 o TYPE 6: Egress (AAL5) to Packet Data 

10 o TYPE 7: Egress (Packets) to AAL5 Cells 

11 o TYPES: Egress (Packets) to Packet Data 
12 

13 - Calculates and checks the CRC-32 for AAL-5 reassembly packets 

14 - Each port has a 32-bit counter to count packets/cells transmitting from that 

15 port 

16 - Each port has a 48-bit counter to count number of data bytes transmitted 

17 through that port 

18 - Each port has a 16-bit counter to count packet CRC errors. 

19 - Each port has a 16-bit counter to count TTL Timeout errors. 

20 - Internal memory to store control and data for the 64 ports 

21 o Dual -port SRAM 

22 ■ CPU Data - 32 x 64 bits to store CPU data 

23 ■ Data - 1024 x 64 bits to store Cell data 

24 ■ Header - 256 x 64 bits to store Header data 

25 ■ Output Port Control - 128 x 21 bits to store Output Port Control 

26 information and Next Pointer links. 

27 ■ Output Port Queue - 64 x 24 bits to store Output Port Queue 

28 Pointers to the Data cells 

29 ■ CRC - 64 X 32 bits to store the partial CRC for each port 

30 o Single-port SRAM 

31 ■ Reassembly Table - 64 x 18 bits to store packet length, SOP and 

32 EOP 

33 ■ Free Buffer - 128 x 10 bits to maintain the free buffer list for data 

34 SRAM 

35 ■ Statistic - 64 x 96 bits to store packets, data and errors received for 

36 each port 

37 - Operates at 200 MHz 
38 

39 Functional Description 

40 See Figure 207. The Reassembly Engine reassembles the outgoing data from 

41 cells stored internally into ATM cells, Switch cells, or Packet data based on the 

42 application. 

43 As shown in Figure 206 there could be 8 different types of reassembly. The 

44 operation of the Reassembly Engine for each of these types is explained in detail in the 

45 following sections. 
46 
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1 TYPE 1 Reassembly - Ingress Memory cells => Switch cells 

2 Type 1 reassembly maps the 64-byte cells in Memory into switch cells for 

3 transmission by adding a Switch Header. This Reassembly type handles Application 

4 types 0-3 & 7. Figure 2 shows the one to one mapping of ATM Memory cells into switch 

5 cells. The switch header will be 8 or 16 bytes long with the required cell fabric header 

6 being first and our 4-byte header following with the remaining bytes being Pad. Type 1 

7 Reassembly also informs the SPI interface of the number of valid bytes in each 8-byte 

8 word and which Cell is the SOP and which Cell is the HOP for Packet traffic. For AAL5 

9 cells the cells were was stored in memory as AAL5 switch cells with the AAL5 trailer at 

10 the end and Pad to fill the cell to 64 Bytes. For Packet traffic the cells were stored in 

1 1 memory as switch cells with an AAL5 like trailer added to pass on the packet length 

12 information. Figure 3 shows the AAL5 encapsulated cells and Figure 4 shows the pseudo 

13 AAL5 encapsulated packets. For Ingress TM types 0 & 1 that transfer ATM cells the 

14 0AM, EFCI and CLP bits will be inserted in our 4 byte switch header but for Ingress TM 

15 packets, types 2 & 3, the bits will be set to zero. 

16 See Figures 208-10. 
17 

18 TYPE 2 Reassembly - 52 Byte ATM cells => Packet Data 

19 Type 2 Reassembly maps the 52-byte ATM cells stored as 64 byte cells in 

20 Memory into packets and adds a packet header before sending it on to the SPI interface. 

21 Only one ATM cell will be mapped into each packet. Eight of the 12 bytes of Pad data at 

22 the end of each cell will be removed before the packet is sent out. This will allow us to 

23 send out a 64-byte packet including 8-bytes of header. If the Switch Packet Header is 16- 

24 bytes then the packet will be 72-bytes long. The 0AM, EFCI and CLP bits will be 

25 passed in our 4 bytes of the Switch Packet Header. 

26 See Figure 211. 
27 
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1 

2 TYPE 3 Reassembly - 48 Byte ATM cells => Packet Data 

3 Type 3 Reassembly maps the 48-byte ATM cells stored in Memory as 64 byte 

4 cells into packets by stripping the 16 bytes of PAD from each cell and then optionally 

5 stripping the L2 Packet Header and adding a Switch Packet Header before sending it on 

6 to the SPI interface. It must check the L2 CRC to ensure the packets integrity before 

7 sending on to the switch fabric as it also checks the CRC in the AAL5 Trailer and 

8 removes the AAL5 encapsulation. The 0AM, EFCI and CLP bits will be passed in our 4 

9 bytes of the Switch Packet Header. It must check the L2 CRC to ensure the packets 

10 integrity before sending it to the switch fabric as the L2 CRC being sent out is now 

1 1 incorrect due to the removal of the L2 Header and must be replaced by the Egress chip 

12 when it adds a new 12 Packet Header. The Lx)okup engine will replace the L2 Header 

13 and the Segmentation will calculate the new CRC. 



14 See Figure 212. 

15 

16 TYPE 4 Reassembly - Switch cells => Packet Data 

17 Type 4 Reassembly maps the AAL5 like Switch cells stored in memory into 

18 packets removes the AAL5 encapsulation and may add an 8 or 16-byte Switch Packet 

19 header. This can also operate without a lookup operation. (Traffic Shaper, Packet in, 

20 Packet out) 

21 See Figure 213. 
22 

23 TYPE 5 Reassembly - 52 Byte ATM cells + PAD => ATM cells with Translation 

24 Type 5 Reassembly translates the ATM Header by replacing the 28-bit VPI/VCI 

25 then replaces the 0AM, CLP & EFCI bits before the cell is sent out. The extra 12 Bytes 

26 of Pad data from the end of each ATM cell stored as 64-byte cells in Memory is stripped 

27 before sending it out as a 52-byte ATM cell. Using the data byte enables of the SPI 

28 interface strips the extra Pad. 

29 See Figure 214. 
30 

31 TYPE 6 Reassembly - 48 Byte ATM cells => Packet Data 

32 Type 6 Reassembly maps the 48-byte AAL5 encapsulated ATM cells stored in 

33 Memory as 64-byte cells into packets by stripping the 16 bytes of PAD from each cell 

34 and then performing a MPLS operation based on the contents of the Flow ID look-up 

35 table before sending it on to the SPI interface. It also checks the CRC in the AAL5 

36 Trailer before removing the AAL5 encapsulation. If the MPLS TTL is one before being 

37 decremented the packet will be dropped in Reassembly. 

38 See Figure 215. 
39 

40 TYPE 7 Reassembly - 48 Byte ATM cells => AAL5 cells 

41 Type 7 reassembly adds the 4 byte ATM Header along with 4 bytes of PAD, so it 

42 is 8 byte word aligned, to the 48-byte AAL5 encapsulated ATM cells stored in Memory 

43 as 64-byte cells, strips the 16 bytes of Pad, replaces the 0AM, CLP & EFCI bits in the 

44 ATM Header and then transmits them as AAL-5 cells. The 4 bytes of alignment PAD 

45 will be stripped using the byte enables when transmitting to the SPI. The EOP AAL-5 

46 cell is marked using the PTI field in the ATM Header. 
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1 See Figure 216. 

2 

3 TYPE 8 Reassembly - Switch cells => Packet Data 

4 Type 8 Reassembly maps the AAL5 like Switch cells stored in memory into 

5 packets and performs a MPLS operation based on the contents of the Flow ID look-up 

6 table. It also checks the CRC in the AAL5 Trailer before removing the AAL5 

7 encapsulation. If the MPLS TTL is one before being decremented the packet will be 

8 dropped in Reassembly. 

9 See Figure 217. 
10 

11 Block Descriptions 

12 See Figure 218. 
13 

14 Description of INPUT/ENQUEUE„SM Block 

1 5 The CTRL.SM block receives PID, FID, TYPE, EOP, SOP, CLP, EFCI and 



16 MEM_RAS_AVAILABLE from the Memory Manager. When AVAILABLE is asserted 

17 the CTRL_SM pops the control information from the Memory Manager control FIFO. 

18 The state machine then reads from the control SRAM using PID as the address to retrieve 

19 parameters for that port. The control state machine also requests a pointer to a free buffer 

20 from the Free Buffer List (FBL). Then it loads the parameters into the internal registers 

21 and starts the DATA_SM. The type of Reassembly is decoded from the TYPE field by 

22 the control state machine and the enable signal for each type is passed to the data state 

23 machine. The SOP and EOP signals inform the Reassembly engine when the first and 

24 last cells of a packet are received. 



25 The FID is used to read the external Header Look-Up SRAM to retrieve the 

26 appropriate header and MPLS information for that Flow Id. This Header is passed on to 

27 the DATA_SM to be added to the reassembled cell. 

28 The PID and other control signals are read from the Memory Manager by the 

29 Reassembly engine 1 clock before the data to give the control state machine enough time 

30 to read the per port parameters from the SRAM and load into internal registers, retrieve a 

3 1 free buffer pointer from the FBL and to start the header read from external memory. 
32 

33 Description of OUTPUT/DEQUEUE.SM Block 

34 The data state machine starts when it receives the start signal from the control 

35 state machine. Once started, the data state machine reads 64-bytes of data from the 

36 Memory Manager data FIFO. The data state machine will also determine from the 

37 TYPE field whether to check the CRC. 

38 When a complete cell is read and the data state machine has added the appropriate 

39 header information obtained from the external Look-Up SRAM, the data state machine 

40 informs the control state machine that it is finished reading the data for that cell. The 

41 control state machine then stores the per-port parameters back in the reassembly control 

42 table. The data state machine is then ready to receive more data. 

43 "Once the next cell has been received and determined not to be an EOP cell then 

44 the previous cell for that port is queued into the Q_FIFO for transmission to the SPL If 

45 the next cell is an EOP then the whole cell must be read and the length determined from 



79 



AZA-OOl/2001-POO 




PATENT 



1 the AAL5 trailer to ensure the proper amount of valid data in the previous cell before it is 

2 queued for transmission. 

3 Once the data for a cell has been stored and the control information is queued in 

4 the Q_FIFO, the Reassembly asserts READY to the SPI interface so it can come and Pop 

5 the Q_FIFO for the control information for that cell. Once the Q_FIFO has been Popped 

6 by the SPI, the Reassembly will start reading the data associated with that control 

7 information and send it on to the SPI 64 bits per 200 Mhz clock cycle until the complete 

8 cell is transmitted. A Data_Valid signal will be active when the data is valid on the bus 

9 and 8 byte enable signals will inform the SPI which bytes have valid data in each 64 bit 

10 transfer. This operation will allow the SPI to control the flow of data by controlling 

1 1 when the Q_FIFO is Popped and also strip the extra pad bytes when necessary. 

12 The data state machine has two types of counters, one for operation and one for 

13 statistic. The cell byte count and packet length counters are for operation. The packet 

14 received count and data received count and CRC errors are for statistic. All counters are 

15 per port basis. 
16 

17 Description of CPU I/F Block 

18 This is a standardized interface to the CPU in the system. It will be instantiated 

19 within each block that requires read/write access to configuration registers or internal and 

20 external memory arrays. During system operation the reassembly engine will use this 

21 interface for writing configuration registers and the header lookup table in external 

22 SSRAM as well as to receive status and statistic information. The CPU will also receive 

23 Data Cells or Packets through this interface using the CPU Port Buffer (port 65). 
24 

25 Description of CPU_PORT Buffer Block 

26 This is a 32 X 64 bit internal Dual Port SRAM to store the incoming data on a per 

27 memory cell basis such that the Reassembly Engine can reassemble the data into the 

28 proper cell configuration before sending it to the CPU. The memory is organized as 4 

29 64-byte buffers to allow 4 cells for the CPU port to be stored. 
30 

3 1 Description of REASM_TABLE_SRAM Block 

32 This is a 64 X 18 bit dual-port internal SRAM to store the parameters for the 64 

33 ports. This RAM is accessed twice (1 read and 1 write) by the control state machine for 

34 each data cell that is received for a particular port. 
35 

36 Description of FBL Block 

37 See Figure 219. This is a 128 x 10 bit single-port internal SRAM to store the free 

38 pointers to the cells in the data SRAM. This list needs to be initialized by software after 

39 reset to have the free pointers. The control state machine requests a free pointer by 

40 reading the list, and the data state machine returns a pointer by writing back to the end of 

41 the list when the SPI receives a complete cell from the Reassembly block. 
42 

43 Description of CRC_GEN Block 

44 This block generates the CRC that covers the PDU (Protocol Data Unit) in the 

45 AAL-5 format. The polynomial used is CRC-32 as shown in Figure 220. There is a 

46 minimum of 3 clocks delay in generating the CRC because the internal data path is 64 
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1 bits and the CRC polynomial is calculated 32 bits at a time. The CRC is generated in 2 

2 stages, first the upper 32 bits and then the lower 32 bits. 

3 The reassembly engine calculates CRC-32 for the AAL-5 data. The CRC_GEN 

4 block reads the partial CRC with port K) as the address to the CRC SRAM. The 

5 CRC_GEN block loads the partial CRC into the CRC engine and starts calculating as the 

6 data is coming in from the memory manager. When the complete cell is loaded in the 

7 internal SRAM the CRC_GEN block writes back the partial CRC into the CRC SRAM, 

8 again using port ID as the address. At the end of a packet, the CRC is compared to the 

9 CRC in the AAL-5 trailer and the packet is marked as bad when sent to the SPI out 
10 interface if the compare fails. 

11 

12 Description of CRC_TBL Block 

13 This is a 64 X 32 bit dual-port internal SRAM to store the partial CRC for each 

14 port. The data state machine controls the read or write to this table based on the traffic 

15 type. 
16 

17 Description of DATA.PIPE Block 

18 The DATA_PIPE block delays the data to the Data SRAM so that it matches the 

19 delay to lookup and inserts the appropriate header information and generates and check 

20 the CRC-32 for all Packet reassembly. 
21 

22 Description of DATA_SRAM Block 

23 This is a 1024 x 64 bit internal Dual Port SRAM to store the incoming data on a 

24 per memory cell basis such that the Reassembly Engine can reassemble the data into the 

25 proper cell configuration before sending it to the SPI. The memory is organized as 128 

26 64-byte buffers to allow 2 cells for each port to be stored. 
27 

28 Description of HEADER_SRAM Block 

29 This is a 256 x 64 bit internal Dual Port SRAM to store the Header data on a per 

30 memory cell basis such that the Reassembly Engine can reassemble the data into the 

3 1 proper cell configuration and attach a programmable Header to it before sending it to the 

32 SPI. The memory is organized as 128 16-byte buffers to allow 2 16-byte Headers for 

33 each port to be stored. 
34 

35 Description of OUTPUT PORT QUEUE Block 

36 This is a 64 X 30 bit single-port internal SRAM to queue information of the 

37 complete cells. The data state machine queues the cell control information when a 

38 complete cell is ready to send to the SPI interface. The Reassembly Engine sends an 

39 Available signal to the SPIO when the Q_FIFO is not empty telling the SPIO that at least 

40 one cell of data is available for transmission. 

41 Control information delineates the SOP, EOP and SOB as well as the Port 

42 Number and whether the packet data had a CRC error or another error condition 

43 occurred. The FIFO also holds the 10-bit pointer to the data cell so the Reassembly will 

44 send the associated data with that control information. 
45 
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1 Description of OUTPUT CONTROL_SRAM Block 

2 This is a 64 X 30 bit single-port internal SRAM to queue information of the 

3 complete cells. The data state machine queues the cell control information when a 

4 complete cell is ready to send to the SPI interface. The Reassembly Engine sends an 

5 Available signal to the SPIO when the Q^FIFO is not empty telling the SPIO that at least 

6 one cell of data is available for transmission. 

7 Control information delineates the SOP, EOP and SOB as well as the Port 

8 Number and whether the packet data had a CRC error or another error condition 

9 occurred. The FIFO also holds the 10-bit pointer to the data cell so the Reassembly will 
10 send the associated data with that control information. 

11 

1 2 Description of External Look-Up Memory 

13 This is a 2M X 72 bit external SRAM to store the Header control information per 

14 FLOWE). This includes the appropriately programmed new header for the particular 

15 FLOW or the necessary MPLS manipulation control information for that FLOW. This 

16 can handle IM FLOWs with 16 Bytes of header and 2 bytes of control information per 

17 FLOW. 
18 

19 Description of MPLS Label Manipulation 

20 The FLOW-ID Lookup table will contain the MPLS label operation instructions 

21 and new labels. Three bits in the first control word will contain the encoded instructions 

22 for what MPLS operation to perform. The operations will be either replace the MPLS 

23 tag, Push the MPLS tag once or twice or Pop the MPLS tag once or twice. After each 

24 MPLS operation the original MPLS TTL value will be decremented. If the TTL value is 

25 one before being decremented then the packet must be dropped. This will be done by not 

26 enqueuing the packet into the output Queue FIFO and subsequently dropping the 

27 following cells from the FLOW until the EOP is reached. 

28 If two labels are being pushed then Label 1 will be pushed on first and Label 2 

29 will be pushed on second. If one label is pushed label 1 will be used. If two labels are 

30 popped the third label will be the one remaining on the top and if one label is popped then 

3 1 the second label will be remaining on top. 

32 The TTL will be decremented as follows: 

33 o Replace MPLS Tag - Decrement MPLS TTL 

34 o Push 1 MPLS Tag - Decrement original MPLS TTL 

35 o Push 2 MPLS Tags - Decrement original MPLS TTL 

36 o Pop 1 MPLS Tag - Decrement original MPLS TTL 

37 o Pop 2 MPLS Tags - Decrement original MPLS TTL 
38 

39 TAGS 

40 Figures 221 and 222 show the data formats for each of the supported protocols. 

41 The specifications are not hard coded. The port table will be used for the header's 

42 location. Each field is 8 bits wide with 12 bits VPI and 16 bits VCI field, for a total of 28 

43 bits lookup tag. This 28 bit lookup tag is located in the 1'* 64-bit word, bits 32 thru 63. 

44 The PPP protocol is part of the link layer header. The tag is bit 60:41 of the first 64 bits 

45 word. 
46 
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1 ATM TYPES 

2 If the type of the traffic is ATM, then the CLP, EFCI, and 0AM bits are included 

3 in the Switch Header for Ingress operation and inserted into the ATM Header for Egress 

4 operation. See Figure 223. 
5 

6 Description of Memories 

7 There is one external Memory interface and 10 internal memory blocks for the 

8 Reassembly Engine. There is a 64 x 1 8 bit single port SRAM for a Reassembly Table 

9 SRAM, a 128 X 18 bit dual-port SRAM for Output Port Control information, a 64 x 18 bit 

10 dual-port SRAM for Output Port Queue Pointer information, a 128 x 7 bit single port 

1 1 SRAM for a Free Buffer list, a 1024 x 64 bit dual-port SRAM for DATA SRAM, a 256 x 

12 64 bit dual-port SRAM for HEADER DATA, a 36 x 64 bit dual-port SRAM for CPU 

13 DATA, two 64 x 32 bit dual-port SRAMs for CRC Tables and a 64 x 1 12 bit single port 

14 SRAM for Statistic information. 
15 

16 Data Structure for Reassembly Table 

17 



18 See Figure 224. The REASSEMBLY_TABLE is a 64 x 18 bit internal SRAM 

19 addressed by the port ID (PID). The Reassembly Table has 64 entries, one for each port. 

20 o SOP: Start of Packet 

21 o EOP: End of Packet 

22 o Packet Length: A running count of the number of bytes in that packet. 
23 

24 Data Structure for OUTPUT_CONTROL_MEMORY 

25 See Figure 225. The OUTPUT_CONTROL_MEMORY is a 128 x 18 bit internal 

26 SRAM that is used to store the control information for each cell that is ready to send to 

27 the SPI block along with the next read pointer and next number of Header words for the 

28 next cell linked to that port in the DATA SRAM and the number of valid bytes in that 

29 cell. 

30 o SOP: Start of Packet 

31 o EOP: End of Packet 

32 o CELL_LENGTH: Number of Data bytes vaHd for that cell. Used 

33 internally to create the Byte valid signals to the SPI interface. 

34 (Combine with number of Header words for 

35 RAS_SPI_CELL_LENGTH sent to SPI OUT.) 

36 o NEXTRD_PTR: Pointer to the Head of the next cell of data queued 

37 to that port in the Data SRAM. 

38 o NEXTHDR_WD: Number of header bytes for the next cell of data 

39 queued to that port in the Data SRAM. 
40 

41 Data Structure for OUTPUT_PORT_QUEUE 

42 See Figure 226. The OUTPUT_PORT_QUEUE is a 64 x 18 bit internal SRAM 

43 that is used to store the output port Queue Head (RD_PTR) and Queue Tail (WR_PTR) 

44 pointers for each cell that is ready to send to the SPI block along with the port Empty 

45 status and the number of Header words in the Queue Head cell. 
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1 o RD_PTR: Pointer to the Head of the next cell for that port that should 

2 be sent to the SPI Out. 

3 o WR_PTR: Pointer to the Head of the last cell for that port that should 

4 be sent to the SPI Out. 

5 o E: Empty - Status of cells queued and linked for that port. (Default = 

6 1, Empty) 

7 o PEND: Cell Pending - Cell is waiting to be queued to output port. 

8 (Default = 0, Not Pending) 

9 o HDR_WDS: Number of header words for the next cell for that port to 

10 be sent to the SPI Out. 

11 o 

12 Data Structure for FREE_BUFFER 

13 See Figure 227. The FREE_BUFFER is a 128 x 7 bit internal SRAM that is used 

14 to store the pointer to the free data block in the DATA SRAM. A pointer is taken from 

15 the top of this table when a data block is needed to receive the incoming data. When the 

16 SPI interface gets a complete cell off the queue the pointer to that cell is returned to the 

17 end of this table. 

18 o FREEPTR: Pointer to the head of a 64-byte buffer in the Data 

19 SRAM. 
20 

2 1 Data Structure for DAT A_SRAM 

22 See Figure 228. The DATA_SRAM is used to store the data while reassembling 

23 it into the proper cells for transmission. There are 128 64-byte buffers. This can support 

24 128 64-byte cells. 
25 

26 Data Structure for HEADER_SRAM 

27 See Figure 229. The DATA_SRAM is used to store the data while reassembling 

28 it into the proper cells for transmission. There are 128 16-byte buffers. This can support 

29 128 16-byte Headers. 
30 

3 1 Data Structure for CPU.DAT A_SRAM 

32 See Figure 230. The CPU_DATA_SRAM is used to store cell data destined for 

33 the CPU. There are 4 72-byte buffers. This can support 4 64-byte Cells of Data plus 8 

34 bytes of control information. 
35 

36 Data Structure for CRC_TABLE 

37 See Figure 23 1 . The CRC_TABLE is a 64 x 32 bit internal dual port SRAM that 

38 stores the partial CRC for each port reassembly. 

39 o PART_CRC: Partial CRC stored after each cell for that port. Used to 

40 calculate the CRC for an entire packet. 
41 

42 Data Structure for STATISTICS 

43 See Figure 232. The Statistic SRAM is a 64 x 1 12 bit internal SRAM addressed 

44 by the port ID (PID). Each port has an entry to this SRAM to store the number of packets 

45 and data received for that port along with the number of CRC error packets. The content 

46 of this SRAM is accessed through the PCI interface block. 
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1 o PKT_RCVD: Counter for the number of Packets received. 

2 o DATA_RCVD: Counter for the number ofData bytes received. Only 

3 includes valid data bytes. 

4 o CRC_ERRORS: Counter for the number of CRC Errors received. 

5 o TTL_ERRORS: Counter for the number of TTL Timeout Errors. 

6 

7 Data Structure for EXTERNAL LOOK-UP MEMORY 

8 See Figure 233. The External Look-Up SRAM is 4M x 36 bits made up of eight 



9 5 1 2K X 36 bit external SRAMs addressed by the FLOW ID (FID). Each FLOW has two 

10 entries to this SRAM to store the Header data for that Flow. The Flow Header Data 

1 1 (FHD) is two or four 36-bit words that contain up to 2 MPLS tags, 1 VPWCI label, or a 

12 Layer 2 Header. On the Ingress chip a total of 12 bytes are available for new user 

13 defined headers with the remaining 4 bytes of Header data used for a Maximus internally 

14 defined Switch Header. On the Egress chip all 16 bytes are available for new user 

15 defined headers as our Maximus 4 byte header will not be append in this case. The two 

16 control bytes contain a number of fields that indicate how to process the packet or cell. 

17 Each FHD is linked to a user defined Header, an MPLS tag or a VPWCI label through 

18 theFLOWID. 
19 

20 FLOW HEADERCFORMAt: 



21 See Figure 234. The following fields are in the Flow Header Data Table: 

22 1 . 1 Byte Control word in 1^' FLOWID word to determines the type of FHD 

23 (MPLS, ATM or L2) and the MPLS operation to perform as well as the 

24 number of Header words. 

25 2. 32 ATM/32 MPLS bit New Label 1 

26 3. 32-bit MPLS New Label 2 

27 4. 1 Byte Control word in 2"" FLOWID word to determine the number of L2 

28 Header bytes to be stripped. 

29 5. 32 bits for possible Longer Header Formats. 

30 6. 32 bits for our internal Switch Header information. 
31 

32 CONTROL BYTE FX3RMAT: 

33 See Figure 235. The first control word is bits 7 1 to 64 of the first 72-bit word of the 

34 FLOW ID with the second control word being bits 7 1 to 64 of the second 72-bit word for 

35 that FLOW ED. The following fields are in the Flow Header Control Words: 

36 1 . HDR TYPE = 2 bits that determines the type of Header (MPLS, ATM or 

37 Ethernet) 

38 a. 00 = MPLS 

39 b. 10 = ATM 

40 c. 01=L2 

41 d. 11= Reserved 

42 2. MPLS OPERATION = 3 bits to indicate the type of MPLS processing to 

43 perform. 

44 e. 000 = Replace MPLS Tag; Decrement MPLS TTL 

45 f. 001 = Push 1 MPLS Tag; Decrement original MPLS TTL 

46 g. 010 = Push 2 MPLS Tags; Decrement original MPLS TTL 



85 



AZA-001/2001-P00^ ^ PATENT 

1 h. 01 1 = Pop 1 MPLS Tag; Decrement original MPLS TTL 

2 i. 100 = Pop 2 MPLS Tags; Decrement original MPLS TTL 

3 j. The last 3 cases are reserved for future implementations. 

4 3. HDR = 2 bits to indicate the number of valid Header words. (00 = 0, 01=1, 

5 1X=2) 

6 4. L2EN = L2 CRC checking Enabled 

7 5. # BYTES L2 HDR STRIP = 4 bits for Number of L2 Header Bytes to strip if 

8 type is ETHERNET. 

9 6. RSVD = 4 bits Reserved for future use. 
10 

1 1 MAXIMUS 4 BYTE HEADER FORMAT: 

12 See Figure 236. The Ingress Maximus chip will append a proprietary 4-byte: header 

13 onto the end of any 8 or 16 byte switch or packet header before sending the data out. 

14 This allows the user to append a programmable 4 to 12 byte header along with the 

15 Maximus 4-byte header onto every packet or switch cell that is transmitted from the 

16 Ingress Maximus chip. The Maximus header contains information of Flow ID, 

17 application types and Class, along with the 0AM, CLP and EFCI bits for ATM cells. 

18 o FID: FLOW ID 

19 o TYPE: Application Type 

20 o EFCI: Explicit Forward Congestion Indication 

21 o CLP: Cell Loss Priority 

22 o OAM: Operation, Administration and Maintenance 

23 o CLASS: Class of service 

24 o EOP: End Of Packet 

25 o SOP: Start Of Packet 

26 The FID and CLASS will be obtained from the Header Lookup memory while the 

27 remaining signals will be obtained from the Memory Manager interface. 
28 

29 ATM HEADER FORMAT: 

30 See Figure 237. 

31 o VPI: Virtual Path Identifier 

32 o VCI: Virtual Channel Identifier 

33 o PT: PayloadType 

34 - EOP: End of Packet 

35 - EFCI: Explicit Forward Congestion Indication 

36 - MI: Management Information 

37 o CLP: Cell Loss Priority 

38 o HEC: Header Error Control 
39 

40 MPLS HEADER FORMAT: 

41 See Figure 238. 
42 

43 AAL5 TRAILER FORMAT: 

44 See Figure 239. 

45 o CPCS-UU: Common Part Convergence Sublayer 

46 o CPI: Common Part Indicator 
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1 o L: Length of payload 

2 o CRC: Error detection in SSCS PDU (user data) 
3 

4 Interface 

5 System 

6 See Figure 240. 

7 Reassembly O Memory Manager 

8 See Figure 241. 

9 Reassembly O SPI 

10 See Figure 242. 

11 Reassembly <»CPU 

12 See Figure 243. 
13 

14 TIMING 

15 See Figures 244-6. 
16 

17 Memories 

18 Input phase to RAS from Internal Memory Manager. See Figure 247. Note: R2 

19 for the Partial CRC Table is dependent on the PID. If the next cell has the same PED as 

20 the previous cell then this read information is not used. 
21 

22 ERROR CONDITIONS 

23 Several errors can happen in the Reassembly engine with the error packets being 

24 marked during the EOP transfer to the SPI as EOP Abort. The different errors are listed 

25 in Figure 248. 
26 

27 Register Description 

28 

29 Registers 

30 See Figure 249. 

31 Commands: 

32 See Figure 250. 
33 

34 Commands 

35 

36 Read external RAS memory 

37 See Figure 251. 
38 



39 Write external RAS memory 

40 See Figure 252. 
41 
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1 FLOW HEADER FORMAT: 

2 See Figure 253. The following fields are in the Flow Header Data Table: 

3 7. 1 Byte Control word in 1'' FLOWID word to determines the type of FHD 

4 (MPLS, ATM or Ethernet) and the MPLS operation to perform as well as the 

5 number of Header words. 

6 8. 32 ATM/32 MPLS bit New Label 1 

7 9. 32-bit MPLS New Label 2 

8 10. 1 Byte Control word in 2"** FLOWID word to determine the number of 

9 Ethernet L2 Header bytes to be stripped. 

10 11.32 bits for possible Longer Header Formats. 

1 1 12. 32 bits for our internal Switch Header information. 
12 

13 Read Internal Data Memory 

14 See Figure 254. 
15 

16 Write Internal Data Memory 

17 See Figure 255. 
18 

1 9 Read Four Internal Memories (Partial CRCl, Partial CRC2, Reassembly Table, 

20 Output Queue) 

21 See Figure 256. 
22 

23 Write Four Internal Memories (Partial CRCl, Partial CRC2, Reassembly Table, 

24 Output Queue) 

25 See Figure 257. 
26 

27 Read PORT Statistics 

28 See Figure 258. 
29 

30 Write PORT Statistics 

31 See Figure 259. 
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1 

2 Read CPU Data Memory 

3 See Figure 260. 
4 

5 Write CPU Data Memory 

6 See Figure 261. 
7 

8 Read 2 Internal Memories (Free Buffer List (FBL), Output Control) 

9 See Figure 262. 
10 

1 1 Write 2 Internal Memories (Free Buffer List (FBL), Output Control) 

12 See Figure 263. 
13 

14 Read Internal Header Data Memory 

15 See Figure 264. 



16 

1 7 Write Internal Header Data Memory 

18 See Figure 265. 
19 

20 Get CPU Cell command 

21 See Figure 266. This command reads 64 bytes from the CPU Data FIFO 8 bytes 

22 at a time and stores them in the general purpose registers RO- R15. There is no address 

23 associated with this command as it just reads the CPU Data FIFO 8 times and then loads 

24 the Registers with the data. 
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1 

2 Initialize Free Buffer List and Output Port Queue Memories command 

3 See Figure 267. This command initializes the Free Buffer List in the FBL 

4 Memory and sets the Empty and Pending bits to inactive in the Output Port Queue 

5 Memory. 



7 Reserved Op-codes 

8 Op-codes 17 to 31 are reserved for future use. 
9 

10 Principle of Operation 

1 1 The re-assembly block keeps track of a FLOW ID per port. The scheduler has a 

12 weighted round robin scheme that it uses to schedule a CELL for each port. If a port has 

13 more weight (or bandwidth) then it schedules more cells for that port. These scheduled 

14 cells are provided to the de-queue engine which sends the requests on to the Internal 

15 Memory Manager for pulling them from the SRAM. 

16 The Internal Memory Manager reads the scheduled cell from the external SRAM 

17 and stores it in an Output Data FIFO. Once the cell is stored in the Output Data FIFO the 

18 Internal Memory Manager queues the control information for that cell into the Output 

19 Control FIFO and asserts an available signal to the Reassembly Engine. 

20 The Reassembly Engine pops the Output Control FIFO to get the control 

21 information for the available cell. It uses the FLOW ID to look up the Header processing 

22 necessary for that cell as well as the Type field to determine the type of Reassembly to 

23 perform. 

24 It also obtains a pointer to the Internal Dual Port Data SRAM from the FBL FIFO. 

25 Once it has started the Header look up and obtained a Free Buffer Pointer the Reassembly 

26 Engine starts reading the data from the Internal Memory Manager Output Data FIFO. 

27 The data is read 128 bits at a time at 100 MHz. Each Data cell is 64 bytes long, with 

28 each data buffer being 64 bytes deep. There are 128 64-byte buffers in the Internal Data 

29 SRAM. The Header information will be stored in a separate Internal Dual Port SRAM 

30 that is also addressed from the FBL Pointer. The Header buffer has 128 16-byte buffers 

31 so it can handle up to 16 bytes per Header. The Header will be stored concurrently with 

32 the Cell Data being stored in the data buffer. 

33 Once the Header and Data are stored locally in memory the cell control 

34 information will be queued in the Output Control Memory and the Buffer pointer will be 

35 queued in the Output Port Queue Memory. If more than one cell is queued on a port then 

36 the Next Read Pointer will be stored in the Output Control Memory of the previous cell 

37 to link the current cell to the previous cell. 

38 The SPI has a weighted round robin scheme that it uses to Request a CELL for 

39 each port. If a port has more weight (or bandwidth) then it Requests more cells for that 

40 port. When the SPI requests a cell for a particular port the Reassembly checks the ports 

41 Output Port Queue to see if there are any cells available for that port. If there are no cells 
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1 available the Reassembly informs the SPI that the port has no data by asserting an Empty 

2 signal and the SPI continues on to the next port. If the port has cells available the 

3 Reassembly uses the Read pointer stored in the Output Port Queue to read the control 

4 information for that cell along with the associated Header and Cell Data. The 

5 Reassembly sends the Header and Cell Data to the SPI interface along with a valid signal 

6 and byte enables to specify the valid data bytes. The SPI will use the byte enables to strip 

7 any unwanted padding. After the Reassembly Engine sends the complete cell to the SPI 

8 interface it returns the Read Pointer to the FBL PTFO. 

9 The Reassembly engine has a Reassembly status table for each of the ports. It can 

10 simultaneously reassemble one packet at a time for each port. The local port Reassembly 

1 1 status table stores the control information for each port along with state information and 

12 or any partial cell information. The control information tells the re-assembly block how 

13 to re-assemble the FLOW for that port, whether it is ATM to ATM, ATM to packet, 

14 packet to ATM or packet to packet. 

15 For AAL5 to Packet reassembly a CRC- 32 is calculated on the packet as each 

16 cell is received. The partial CRC is stored on a per port basis and loaded back into the 

17 CRC engine for each subsequent cell in that packet. Once it has received the end of the 

18 packet the calculated CRC is check against the received CRC to ensure packet integrity. 

19 If the CRC is in error the packet will be marked as bad by setting the Bad control bit in 

20 the Output Queue FIFO for the SPI interface. 

21 For Layer 2 Packet reassembly the CRC-32 is checked on the packet as it is being 

22 sent to the SPI out interface. The CRC is calculated the same as the AAL5 CRC-32, but 

23 if there is an error detected it must be sent immediately to the SPI out as the complete 

24 packet has been sent in this case. The CRC checking is user selectable by programming 

25 the CRC enable bit in the Flow ID Lookup table. 

26 Statistics are kept on a per port basis for the number of Packets sent, the number 

27 of Bytes sent and the number of errors received. The CPU can read and write internal 

28 registers or statistics on a per port basis as well as access all internal memory locations 

29 and the external Header Lookup memory through the PCI interface. Data packets can 

30 also be sent to the CPU through the PCI interface. 
31 

32 Initialization Setup 

33 The initialization process consists of writing the Initialize Memories command 

34 op-code to the Conrmiand register. The Reassembly block will then initialize the Free 

35 Buffer List and clear all the Empty and Pending bits in the Output Port Queue Memory. 

36 When the initialization process is done, the command op-code will be cleared to all 

37 zeroes to inform the system CPU that the initialization process is complete. 
38 

39 Re-assembly flow 

40 1. Output scheduler selects the port. 

41 2. Priority scheduler for port decides on the FLOW ID. 

42 3. FLOW ID is sent to the De-queue engine. 

43 4. Output scheduler keeps the current head and other information per port. 

44 5. De-queue engine queues the buffer pointers along with corresponding port # 

45 for Internal Memory Manager. 
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1 6. Internal Memory Manager gives a data available signal to the reassembly 

2 engine when a complete cell is in the output data FIFO in the Internal Memory 

3 Manager. 

4 7. Internal Memory Manager sends ACK back to the scheduler 

5 8. Scheduler releases the buffer from the INTERNAL MEMORY MANAGER 

6 and passes next port and buffer head to the de-queue engine. 

7 9. Internal Memory Manager tells Reassembly that a cell is available. 

8 10. Reassembly engine pulls the control information for that cell from the Internal 

9 Memory Manager control FIFO and then extracts the data from the Internal 

10 Memory Manager output data FIFO. 

11 11. Reassembly uses the control information to look up the Header processing 

12 necessary for that FLOW. It also obtains a free buffer pointer to the Data 

13 SRAM and uses the control information to determine the type of processing to 

14 perform. 

15 12. Reassembly engine stores the new Header information along with the data cell 

16 and Queues the cell onto the appropriate output port. 

17 13. The SPIO requests data on a per port basis based on its own internal port 

18 calendar. 

19 14. Once the SPIO requests data for a port that has a cell queued on that port the 

20 Reassembly sends the control information for that cell along with the 

21 associated Header and Cell Data for transmission. 

22 15. The SPIO will strip extraneous pad data based on the byte enables that go with 

23 the data. 

24 16. The Reassembly engine sends CPU Buffer Full status information to the SPIO 

25 for combining with the port FIFO status information that is sent to the 

26 Scheduler from the SPIO. 
27 

28 



29 
30 
31 
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1 

2 Per Flow Queue Engine 

3 The Per flow queue stores flows parameters and statistics in dedicated FID memory. 

4 In addition, it stores memory buffer pointers or IDs that hold data traffic of the flow IDs 

5 in a dedicated BID memory. Furthermore, it will retrieve the above FID parameters, 

6 statistics and memory buffer IDs when they are needed. It manages the free buffers list. It 

7 performs Random early discards on FID data cells. 
8 

9 Features 

10 • Supports IM possible flows 

1 1 • Supports IK Tunnel Flows. 

12 • Supports IK Multicast Flows. 

13 • Multicast one flow to 1 K flows maximum. 

14 • Collects statistics on a per flow basis. 

15 o Packet Counters: 32 Bits. 

16 o Byte Counters: 48 Bits. 

17 o Packet Discards per external Requests such as SPI: 16 Bits 

18 o Packet Discards per Reassembly Timeout request: 16 bits. 

19 o Packet Discards per internal buffer management request: 16 bits. 

20 • Discards by returning buffer Ids to free buffer list. 

21 • Manages the Free Buffer List. 

22 • Manages RED by FID association. 

23 • RED Memory: IK x 48 bits. 

24 • FID Enqueue Memory: 2M x 72 bits. 

25 • FE) Dequeue Memory: IM x 36 bits. 

26 • FID Statistics Memory: 2M x 72 bits. 

27 • BIDLL Memory: 8M x 36 bits, including free buffer list. 

28 • Tunneling Dequeue Leaf Memory: IK x 56 bits, internal. 

29 • Tunneling Enqueue Leaf Memory: 2K x 82 bits, internal. 

30 • Tunneling Root Memory: 512x21 bits, internal. 

31 • Multicast Dequeue Leaf Memory: IK x 36 bits, internal. 

32 • Multicast Enqueue Leaf Memory: 2K x 82 bits, internal. 

33 • Multicast Root Memory: 512 x 30 bits, internal. 

34 • Clock Frequency: 200 MHz. 
35 

36 Functional Description 

37 See Figure 278. The Per Flow Queue Engine consists of the following blocks: 



38 Enqueue engine, Dequeue engine. Free List buffers manager, RED manager, Statistics 

39 engine, Timeout manager, and CIH Director. Enqueue engine places received data cell 

40 pointers on the queues. Dequeue engine retrieves and removes data cell pointers from the 

41 queues. The RED manager will track the total number of buffers on eight CLASS 

42 counters using each of the associated RED parameters. The cells will be dropped when 

43 the RED algorithms flags the cells as such. The statistics manager counts per flow 

44 various parameters. The timeout manager discards packets or cells when they exceed a 

45 certain timeout threshold on the enqueue side for the reassembly timeout. The timeout 
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1 manager discards packets or cells when they exceed a certain timeout threshold on the 

2 dequeue side for the reassembly timeout. The PFQ interfaces to the memory manager, to 

3 the shaper and to the scheduler. The Per Flow Queue Engine handles the flow tunneling, 

4 discarding and multicasting of FID in addition to enqueueing and dequeueing the FIDS. 

5 The statistics block will collect on a per flow basis the number of discarded flows, the 

6 number of bad flows, and the number of dropped flows because of RED. It will provide 

7 free buffers BIDS to the memory manager and it returns the buffer BIDS to the free 

8 buffer list. 
9 

10 Block Description 

1 1 Algorithms 

12 

1 3 Enqueue Normal FID Algorithm 



14 The enqueue request will place a BID of an FID on the queue. It will modify the 

15 BID as well as the FID Enqueue memory. 

16 When the queue is empty, the first enqueue is for an SOP cell. If the cell is not SOP, then 

17 the cells are dropped until the next SOP. See Figure 279. 

18 1 . The FID Enqueue memory is read, the tail pointer will point to the last BID in the 

19 link. 

20 2. The BED tail is compared to "Null"; if it is a null then the queue is empty. 

21 3. The current BID tail that was stored in the FID Enqueue memory BIDC will be 

22 used to access the BID memory. The new BID Tail is written as the content of 

23 that BIDC Tail location. If the cell is SOP or EOP then the flags are written in the 

24 BID memory with the BID link. 

25 4. The new BID replaces the tail pointer in FID Enqueue memory. 

26 5. The new tail is linked to "NULL" in the BID memory. 

27 • When the tail pointer is read, it will be checked to see if it is null. When it is null 

28 then the Dequeue memory has to be written with the new head and the associated 

29 flags such as the SOP, CLP, and EOP etc. 

30 • If an EOP is received, then a shape command is issued to the shaper. 

3 1 • The count is sent to the statistics block with the FID when the EOP flag is 

32 received. 

33 • The count is sent to the shaper and then it is not saved in the FID or BID 

34 memories. 

35 

36 Dequeue Normal FID Algorithm 

37 The de-queue request will remove a BID from an FID. 

38 1 . The FID Dequeue memory is read; the head BID pointer will point to the first 

39 BID in the link. 
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1 2. The current BID head that was stored in the FID Dequeue memory will be used to 

2 access the BID memory. 

3 3. The new BID will replace the head pointer and the value is written in the FID 

4 memory. If the BID has other flags like SOP, HOP they are written at the same 

5 time as the BID head. 

6 • The new BID is compared to Null. If it is equal to null, then the Queue is empty 

7 and the Enqueue memory for that FID has to have the tail as Null. 

8 • A dequeue command is sent to the memory manager. 

9 See Figure 280. 
10 

1 1 Enqueue Multicast FID Algorithm 

12 When the queue is empty, the first enqueue is for an SOP cell. If the cell is not 



13 SOP, then the cells are dropped until the next SOP. The bit in the control register will 

14 determine the treATMent of multicast traffic. If the bit is set to treat the multicast traffic 

15 as Unicast, then the traffic is enqueued, as Unicast one. Else, the traffic is treated as 

16 described in the following section. The multicast control bit in the control register is 

17 reset. The PFQ will discard all the enqueue requests from the memory manager when an 

18 EOP cell is received for a multicast traffic. When the SOP cell is received, it is enqueued 

19 as a Unicast cell. The buffer ID pointers are linked in the memory. The pointers in 

20 memory are enqueued only once. The count is read from the Multicast Root memory. 

21 Every BID will have the count of the previous BID. When an SOP cell is enqueued, the 

22 tail pointer is in the enqueue memory. There is no link in the BID memory to the tail 

23 pointer in the enqueue FID yet. When a NULL is written to next location, the count is 

24 properly written in the correct field. When the EOP cell is received, then the BID link is 

25 performed as it was done for the previous cells. When the BID of the EOP cell is linked, 

26 the multicast enqueue operations has to link the head and tail BIDS to the various FIDS 

27 that are members of the multicast group. At this time, the enqueue engine will discard all 

28 the cells with the enqueue request from the memory manager until the multicast packet 

29 has been linked to all the FIDS in the multicast group. The multicast engine will perform 

30 the rest of the operations to enqueue the packet on the rest of the FIDS. The multicast 

3 1 engine will read the next FID that in the multicast group. The first FID member has 

32 already the pointers linked in its location. The multicast engine will update the head and 

33 the tail pointers in the enqueue and dequeue memories locations that correspond to the 

34 FK) members if the FID queue is empty. If the FID is not empty, then the tail pointer is 

35 updated only. When the last FID member is update it, then the enqueue engine will 

36 continue to enqueue rather than discard the cells. The process is not efficient and the 

37 performance suffers considerably. 



38 The buffer management whether it is RED or class will be performed on the first SOP 

39 when the cell is enqueued for the HEAD member of the multicast group. Thereafter there 

40 are no checks done. After receiving the EOP cell, the DBS will get the FIDS that are 

41 members of the multicast group only. 

42 1 . The packet is enqueued as Unicast one until the EOP cell. 

43 2. When EOP cell is received, then the packet has to be linked for the rest of the FID 

44 members of the multicast group. 

45 3. The FID root memory is read to retrieve the head FID, tail FID pointers. 
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1 4. The head FID has the packet already Hnked on its queue. The next FID field in the 

2 multicast group is read. 

3 5. The next FID is compared to the tail FID to determine the last FID in the 

4 multicast group. 

5 6. The LEAF memory of the FID in the next FID field is read to determine if the 

6 queue is empty. If the queue of the FID is not empty, then the tail pointer is 

7 updated it with the last pointer of the HOP cell. The new tail pointer is linked to 

8 the old tail pointer in the BID memory and the COUNT is written there, too. 

9 7. If it is empty, then the tail pointer in the enqueue memory is updated it from the 

10 enqueue memory. The head pointer in the dequeue memory has to be updated it 

1 1 from the dequeue memory. The leaf memory is updated it with BID pointers. 

12 8. The next FID in the FID leaf memory is compared to the tail FID in the root 

13 memory and the above operation is repeated until the next FID and the tail FID 

14 are equal. Then it is the end of the multicast group. 

15 9. The PFQ_MEM_FULL signal is asserted to the memory manager. 

16 

17 Dequeue Multicast FID Algorithm 

18 The dequeue command for a multicast FID is similar to the Unicast FID. They BID 

19 link is read from the BID memory and it will be the new head pointer. In addition the 

20 counter is read from the BID location that is the new head pointer. The counter is 

21 decremented and then written back in the same location with that BID. If the counter 

22 value is zero, then the "REL" bit in the command to the memory manager is set. Else the 

23 "REL" bit is reset and there will be no release command for that BE) from the memory 

24 manager. If the BID link in the BID memory is "NULL", the queue for that FID is 

25 empty. The "NULL" has to be written back in that locations with the counter value equal 

26 to the previous count but decremented by 1 . 

27 1. The FID leaf memory is read. 

28 2. The location read is the FID that is part of the dequeue command. 

29 3. The head BID pointer in the FID memory will point to a location in the BID 

30 memory. 

31 4. The location contains the next BID head pointer and the count. 

32 5. The count is decremented by one and written into memory. 

33 6. The new BID will replace the head pointer and the value is written in the FK) leaf 

34 memory. 

35 7. The "REL" bit in the command is set when the count is zero. 

36 8. When the pointer is "NULL", it is handled the same way as in the Unicast FID. 
37 

38 Enqueue Tunneled FID Algorithm 

39 When the queue is empty, the first enqueue is for an SOP cell. If the cell is not SOP, 

40 then the cells are dropped until the next SOP. The enqueue of the tunneled FID is exactly 

41 as the normal FID. When an SOP cell is received, the tail and head pointers are updated it 

42 as usual. When the EOP cell is received, then the FID that the PFQ sends to the DBS is 

43 not the same FK) that was used to enqueue the packet. Instead, the FID will be read from 

44 the Leaf memory from a memory location pointed to by the FED of the enqueue 

45 command and it is in the ROOT Tunneling field. The FID that was read as the Tunnel 

46 ROOT memory represents one or more FIDS. The DBS will not know about the 
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1 members of the ROOT FID. The FID in order to be tunneled has to be a member of that 

2 specific tunnel. 

3 1. The leaf FID memory is read, the tail pointer will point. to the last BID in the link. 

4 The root memory association is read. 

5 2. The current BID tail that was stored in the FID memory will be used to access the 

6 BID memory. In that location the new BID is written. 

7 3. The new BID replaces the tail pointer and the FID location is written to the leaf 

8 FID. 

9 4. The ROOT memory corresponding location is read. 

10 5. If the not empty flag is reset, then the LEAF FID is written as the "CURRENT 

1 1 FID HEAD" and as the Tail FID 

12 6. In EOF cell cases, the ROOT FID is sent to the DBS with the cell count. 
13 

14 Dequeue Tunneled FID Algorithm 

15 The dequeue command that the DBS send to the PFQ contains an FID. The FID is the 

16 ROOT FID. The root memory has to be read and the member of the tunnel group has to 

17 be found. The Root memory has the head of the FID. The head is read and the FID leaf 

18 memory is read. After reading the Leaf memory, the dequeue operation proceeds the 

19 normal way. When the cell is EOF, then the "NEXT FID" field is used to update the new 

20 FK) head pointer. If the next FID queue is empty, then the next FID field in that FID 

21 entry will be used and so on. 

22 1 . The FID Root memory is read; the current FID pointer of the tunneled FID is 

23 retrieved. 

24 2. The FID leaf memory is read and the current BID head that was stored in the FID 

25 memory location will be used to access the BID memory. 

26 3. The new BID will replace the head pointer and the value is written in the FID leaf 

27 memory. 

28 4. The LEAF FID and the BID are sent to the memory manager. 

29 5. If the BID belongs to an EOF cell, then the "NEXT FID" field is read from the 

30 leaf memory. 

31 6. The previously read "NEXT FID" field will be used to update the "CURRENT 

32 FID TUNNEL HEAD" in the root memory. 

33 7. The new FID head leaf memory entry is read to determine if the queue is not 

34 empty. 

35 8. If the queue of the head FID is empty, then the "NEXT FID" field of that FID is 

36 read and the process is repeated until an FID without an empty queue is reached. 

37 9. During 

38 10. During the process, if the tail FID is reached, then there are no more BIDS on any 

39 queue. 

40 1 1. At that point the process terminates until the next dequeue command. The "NOT 

41 EMPTY" flag is reset. 

42 12. During the above process, assert "PFQ_ DBS_FULL" to the DBS. 
43 

44 Enqueue Engine 

45 See Figure 181. The enqueue engine gets its commands from the memory. The 

46 enqueue engine issues commands to the shaper. If there are no free buffers or the request 
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1 is to discard the FID, then the "NULL" FED is returned. If there is an enqueue request and 

2 there are no free buffers available, then the "NULL" BID is sent to the memory manager. 

3 If the RED determines that the packet has to be discarded, then the "NULL" BID is sent 

4 to the memory manager. 

5 The enqueue engine will enqueue the FID, discard the BID marked for discard, 

6 enqueue the multicast FID, and enqueue the Tunneling FID. The enqueue engine has to 

7 issue the FID and the BID memory commands always within the eight cycles window. 

8 The FED of the enqueue and de-queue commands will be compared and checked if they 

9 are the same, ff they are the same FID, then only the last FID memory write is valid and 

10 has the correct information. The Enqueue information will be merged with the dequeue 

1 1 information and written during the dequeue operation. The BID memory accesses depend 

12 on the FID results. The data coherency of the BED during multicast dequeue operations 

13 has to be maintained by the logic with respect to the BID content of that location and its 

14 count in case of multicast. 



15 The Tunneling of FIDS as well as the multicasting of FIDS is based on a circular 

16 buffer. Once the last FID is reached, then the enqueueing engine will point to the first 

17 FID in the link. Tunneling uses the root FID in the enqueue as well as in the dequeue 

18 process. The Shaper has no knowledge of the FIDS that are members of the Tunneled 

19 FID. The shaper has no knowledge of the multicast root FID. 

20 When the memory manager issues a command to the enqueue engine, the FID 

21 parameters are retrieved and the BID is enqueued. If the EOF is set, then a shape 

22 command is sent to the shaper. If the FID was empty, then the FID is activated. 

23 1 . The enqueue engine will read the following: 

24 • FID parameters from FID enqueue memory. 

25 • Free buffer head from the free buffer register. 

26 • RED counter. 

27 2. The enqueue engine will read the following: 

28 • Read the BID memory location FBH to get the new FBH. 

29 3. The corresponding statistics from statistics memory of the FID. 

30 4. The enqueue engine will perform the checking operations of the various 

31 conditions. The counters are incremented. 

32 5. The enqueue engine will write the following: 

33 • The BID memory location current BID Tail with BE) as content. 

34 • The BID memory location new BID tail with the "NULL" as content. 

35 • The "FID' ' enqueue location with the new parameters. 

36 • The "FID" dequeue location with the new tail if the queue is empty. 

37 • The FID statistics location with the new statistics. 

38 • The RED Counter with the new value. 

39 • The Free buffer register with the new head. 



40 

4 1 Dequeue Engine 

42 The Dequeue engine gets its commands from the scheduler. The Dequeue engine issues 

43 conrmiands to the memory manager and to the enqueue engine. When there is a dequeue 

44 command, the PFQ will get all the parameters from the FID and sends a conmiand to the 

45 memory manager for retrieval of data. Then the PFQ will update the buffers pointers 
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1 See Figure 282. 

2 1 . The dequeue engine will read the following: 

3 • FID parameters from the Dequeue FID memory. 

4 • The BID memory location BID head. 

5 2. The de-queue engine will perform the checking operations of the various 

6 conditions. The head pointers are updated. If the head is null then the queue is 

7 empty. 

8 3. The dequeue engine will write the following: 

9 • The "FID" dequeue location with the new parameters. 

10 • The "FID" enqueue location with the new tail "NULL" if empty. 
11 

12 Release BID 

13 The free buffer engine will return the BID to the free buffer list. It can be 

14 simultaneously issued with the enqueue request. The bit "REL" will determine if the 

15 memory manager will release the BID or not. 

16 • The free buffer BID memory with the new Hnk BID. 
17 

1 8 Free buffer manager 

19 There is a free buffers list. There is a corresponding head and tail registers on the 

20 device. There is a link list linked in the BID memory. The link list is initialized on init 

21 command. The enqueue will use the head of the free buffers. The tail of the free buffer 

22 list is updating it when the enqueue and dequeue engines are accessing it. The buffer 

23 number "7FFFFF' is reserved for not valid BID. The cell should be dropped when the 

24 memory manager gets this value in case of a discard request from the memory manger or 

25 the RED algorithm. See Figure 283. 

26 There are three parameters that are stored in registers for the free buffer list. 

27 • The number of available buffers. It represents the raw total storage memory space. 

28 • The threshold of no buffers. 

29 • The number of used buffers. 

30 The following operations are done on the counters: 

31 • Whenever a buffer is enqueued, then the number of used buffers is incremented. 

32 • When a buffer is released, then the number of used buffers is decremented. 

33 • The number of used buffers can never be less than "0". 

34 • The number of used buffers can never be more than the available buffers. 

35 When the number of used buffers is equal or larger to the threshold of number of buffers, 

36 then: 

37 • All the queue of that cell has to be emptied and all the buffers of that queue are 

38 returned to the free buffers list. The buffers in use counter will be decremented by 

39 the number of cells. 

40 • Update the free buffer register and memory link in the BID memory. 
41 

42 Timeout manager 

43 It is running with simultaneously with the enqueue operation. The timeout 

44 manager is used to remove orphan cells for reassembly timeout and free the storage for 

45 incoming traffic. When an FID corresponding memory location is read for an enqueue 
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1 operation, there are two TTL bits that are reset to "00" when the FID memory location is 

2 written back. The memories that are accessed are the external FID memory and the 

3 internal leaf fTD memories. The internal memory will be accessed once the external 

4 memory has been completely accessed once. The algorithm is not a fair one. When the 

5 timeout manager reads a location the TTL bits are changed from "00" to "01" or changed 

6 from "01" to "10". If the FID location is read and the TTL bits are "10", then there is a 

7 reassembly timeout and the packet remnants are discarded. The Qsize has to be 

8 decremented whenever there is a discard. It should have the value "0" in the statistics 

9 memory. 

10 The statistics memory location is read. Every 8 cycles or every time slot, one 

1 1 location of the enqueue well as the statistics memories is read to perform the timeout 

12 operations. If there is a reassembly timeout, then the start of timeout operation is 

13 suspended until the data is updated in both the enqueue and the statistics memories. 

14 The timeout manager will do the following: 

15 • An active address that points to an FID memory location. The address is 

16 sequential and circular. It is updated when the access is executed. 

17 • Every sync cycle the FID memory is accessed, a location corresponding to that 

18 address is read and examined. 

19 • If the TTL bits value is "00", then the value will be changed to "01". 

20 • If the TTL bits value is "01", then the value will be changed to "10". 

21 •If the TTL bits value is "10", then it is reassembly timeout and the packet chunk 

22 is discarded on the enqueue side. 

23 • The TTL bits value then are reset to "00"after the discard. 

24 • The DROP timeout bit is set. All the fragments of a packet has to be dropped 

25 until the next SOP. At the next SOP enqueueing starts again. 



26 

27 DISCARD FID 

28 The discard operation is running in place of the enqueue operation. Whenever 

29 there is a discard because of Reassembly timeout, the Qsize has to be decremented. It 

30 should have the value "0" in the statistics memory. There are several discard commands: 

31 • Memory manager generates a discard command of a packet that is being 

32 enqueued. 

33 • RED manager generates a discard command of a packet violating the algorithm. 

34 • Free Buffer manager generates a discard command of a packet being enqueued 

35 because of lack of free buffers. When the number of used buffers exceeds a 

36 programmed backpressure threshold register, the enqueue engine will signal the 

37 segmentation the above condition. If the number of used buffers drops 10 buffers 

38 below the programmed backpressure threshold, then the PFQ will signal the 

39 above message to the segmentation engine. If number of used buffers exceeds the 

40 progranmied threshold register then the enqueue engine will discard these buffers 

41 until the next EOP. The PFQ will discard the packet fragments from the queue in 

42 such a condition. 

43 • Timeout manager generates a discard command when the packet is not 

44 completely enqueued because the later chunks were not received or got dropped. 

45 The packet is a tail of the queue of that FID. 
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1 • CLASS discard; the packet will be discarded when the queue counter is above the 

2 programmed threshold. On every SOP the CLASS counters are checked. 

3 • Port Blockage; the packet will be dropped from SOP to EOP until the bit is 

4 changed in memory. On every SOP the port register is checked. 

5 When the memory manager generates the discard command, then the following steps are 

6 taken: 

7 1 . The FID enqueue memory is read. The previous packet tail is parsed. 

8 2. The BED memory is read and the BID of that location is read. The content is the 

9 head of the new packet being enqueued. 

10 3. The Free Buffer tail is read. 

1 1 4. The BID memory location of address FBT will have the content the head of the 

12 packet that is being discarded. The buffers are on the free buffers list. 

13 5. All the buffers are returned to the free buffer list. 

14 6. The FID memory location is updated with the new tail pointer, which is the tail of 

15 the FID. 

16 When the free buffer manager generates the discard command, then the following steps 

17 are taken: 

18 • The cell is dropped and the BID is returned to the free buffer list. There is no 

19 other operation required. 

20 When the timeout manager generates the discard command because of reassembly 

21 timeout, then the following steps are taken: 

22 1 . The FID memory is read. The previous packet tail is parsed. 

23 2. The BID memory is read and the BID of that location is read. The content is the 

24 head of the new packet being enqueued. 

25 3. The Free Buffer tail is read. 

26 4. The BID memory location of address FBT will have the content the head of the 

27 packet that is being discarded. The buffers are on the free buffers list. 

28 5. All the buffers are returned to the free buffer list. 

29 6. The FID memory location is updated with the new tail pointer. 

30 When the RED manager generates the discard command, then the following steps are 

31 taken: 

32 1. The cell is dropped. 
33 

34 Port Management 

35 On each SOP enqueue, the enqueue engine will check the port register to 

36 determine if the packet has to be dropped or not. The port number is available from the 

37 statistics memory. If the port register indicates that the cell has to be dropped, then the 

38 SOP cell is dropped. If the cell is not SOP, then the drop packet flag in the FID memory 

39 is checked. If the flag is set then the cell is dropped, else the enqueue operation continues. 

40 The port packet discard flag for that FID is set. The flag will be reset when the port 

41 register indicates that the port is re enabled again. 
42 

43 CLASS buffer Management 

44 Enabling the buffer management bit in the control register will enable either the 

45 RED algorithm or the CLASS management. If the RED bit is reset, in addition to the 

46 previously enabled bit, then the CLASS management algorithm is enabled. 
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1 The enqueue engine will increment the counter in the CLASS register corresponding to 

2 that FID on every cell. The Dequeue engine will decrement the counter on dequeue 

3 command. When enqueue an SOP cell, the counter of the corresponding class. 

4 On each SOP enqueue, the class counter will be compared to the threshold counter of that 

5 class. If the counter value exceeds the threshold, then the SOP cell is dropped. The cells 

6 of the rest of the packet are dropped until the next SOP cell. At that point the counter 

7 value is checked against the threshold and the process is repeated. 
8 

9 Random Early Discard Management 

10 Enabling the buffer management bit in the control register will enable either the 

1 1 RED algorithm or the CLASS management. If the RED bit is set in addition to the 

12 previously enabled bit, then the RED algorithm is enabled. Another way to bypass the 

13 RED is by making both thresholds equal and very high. The average queue will not get as 

14 high as the threshold. The drawback is the calculations are still needed. The queue size 

15 and the averages are calculated according to cell size. The Queue size is updated every 

16 time there is an EOP. It is incremented by the number of cells. The queue size is 

17 decremented every time there is a dequeue operation. It is decremented by one. 

18 When an SOP of an FID is received, the RED algorithm decides if the packet gets 

19 discarded. If the packet is discarded, then the SOP cell is dropped as well as subsequent 

20 arriving cells of the packet are dropped until the EOP cell is received. The BID is 

21 returned to the Free Buffer List. 



22 With constant average queue size the goal is to discard packets at regular intervals 

23 in order to avoid clustering of discards and global synchronization. It is not desirable to 

24 have too many discarded packets close together and also it is undesirable to have a long 

25 interval between discarded packets. 

26 The RED algorithm calculates the queue average size when an SOP cell is 

27 received and compares it to two thresholds. If the size is less than the minimum 

28 threshold, then the packet is not dropped. If the queue is larger than the maximum 

29 threshold then the packet is dropped. If the average size is between the minimum and the 

30 maximum thresholds, then a discard probability is calculated to determine the fate of the 

31 packet. There is a random number that will be used to determine if the packet has to be 

32 discarded in this case. 

33 The first step when a packet is received by an empty queue is to estimate the 

34 number of packets that would have been transmitted during idle time. Then the average 

35 queue size will be calculated using the original estimate as if the number of packets has 

36 been transmitted during that interval. The average queue size is calculated for every FED. 

37 There is 1 K parameter memory for the various attributes such as the minimum and the 

38 maximum thresholds and the weight of the queue. There is an association between the 

39 FIDS and the attributes in the parameter memory. 

40 The MAXth is decreased when the probability Pa of discard is increasing. When 

41 the probability changes slowly the queue average size changes slowly. This will 

42 discourage oscillation in the average queue size. The values of attributes MINth and 

43 MAXth determines the utilization of the bandwidth 

44 RED algorithm needs the following fixed and calculated parameters. 

45 The following variables are calculated and saved; 

46 • "AVG": The average queue size. 
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2 • "COUNT": The number of not discarded packets that have arrived since last 

3 discard, 

4 • "Q": It is the current queue size. It is incremented on enqueue with the number of 

5 cells and decremented with one on dequeue. 

6 The following parameters are fixed and programmable: 

7 • "Wq": It is the queue weight. An initial value of 0.002 is used. It should not be 

8 large in order to filter transient congestions. 

9 • "MINth ': It is the minimum threshold for queue. If the traffic is bur sty, then the 

10 value has to be fairly large. 

1 1 • "MAXth": It is the maximum threshold for queue. The value determines the 

12 maximum average delay in the device. An initial value of 3 * MINth is used. 

13 • "MAXp": It is the maximum value for a probability "Pb" of discard. An initial 

14 value of 1/50 is used. It should never be greater than 0.1. 

15 • "S": Typical transmission time for a small packet. 

16 • "RANDOM": Random number from generator. 

17 The following variables are calculated but not saved: 

18 • "Pa": It is the current packet discard probability. 

19 • "TIME": It is the current time. 

20 • "F (0": It is a linear function of time t. 

21 • "M": The number of packets that would have been transmitted when queue is 

22 empty. 

23 The following calculations are needed: 

24 • The number of packets "m" that might have been transmitted during idle time. 

25 • The average size of the queue. 

26 • The probability of discard when the queue size is between the minimum and the 

27 maximum thresholds. 
28 

29 Equations 

30 The queue empty size "M" equation is the following: 

31 M = (TIME - Q-™)/S 

32 The packet discard probability Pb equation is the following: 

33 Pb = MAXp (AVG - MINth)/(MAXth - MINth) 

34 It is desirable to have Pb a negative power of 2. 

35 The final packet discard probability Pa equation is the following: 

36 Pa = Pb/(l -COUNT *Pb) 

37 The average queue size is calculated according to the following: 

38 AVG = (1 - Wq) * AVG + Wq * Q 
39 

40 Implementation 

41 It is an easy determination when the average queue size is larger than MAXth and 

42 is smaller than MINth. The most complex part is when the average queue size falls 

43 between the MINth and the MAXth. When the queue average size is less than MINth, then 

44 the packet is transmitted. On the other hand when the queue average size is more than 

45 MAXth then the packet is discarded. If the queue average falls between the MINth and 
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1 MAXih then the probability determines the packet destiny. A pseudo random number 

2 "R" is generated by a uniform distribution random numbers generated. 

3 The packet is discarded if "R" satisfies the following equation is satisfied: 

4 R<Pb/(l-COUNT*Pb) 

5 If the average queue remains constant then the random number "R" is selected 

6 from the uniform distribution. The No, packet is discarded if the following equation is 

7 satisfied: 

8 R<PbN 
9 

10 Random Selection 

1 1 The random number is generated using linear feedback shift registers. The 

12 Ethernet polynomial will be used to generate a random number. The number is generated 

13 every time there is an enqueue operation. The following polynomial is used to generate 

14 the random number. 

15 G(X) = X'' + X'^ + X'' + X'' + X^SX'' + X*' + X'' + X' + X' + X' + XSX' + X'+1; 

16 The value of the shift register is used as the number generated. The number of 

17 nibbles that is stored in the attribute table is used to determine the random number is 

18 used. If the value stored is "FF\ then all eight nibbles are used. If the value is "7F', then 

19 the first 7 nibbles are used. See Figure 284. 
20 

2 1 Memory Size & Total Number of BIDS 

22 See Figure 285. 
23 

24 BID Memories Chip Select 

25 See Figure 286. 
26 

27 EXTRA MEMORY CYCLES 

28 The extra needed cycles of the FID memory are the following: 

29 • If the queue becomes empty after a dequeue operation the tail in the enqueue 

30 memory has to be written as "NULL". 

31 •If the queue is empty before an enqueue operation the head in the DEQUEUE 

32 memory has to be written with the same value as the tail pointer. 

33 • The Dequeue operation has to decrement the Qsize in the statistics memory. 

34 The BID memory requires an extra cycle after a tail is linked in memory. The extra 

35 cycle will link the tail to a "NULL". When the head reaches a "NULL'', then the queue is 

36 empty. 
37 

3 8 Memories Byte Enables 

39 The state machines have to drive the byte enables to the various memories. There 

40 are some write cycles during which only selected byte enables have to be asserted. The 

41 rest of the byte enables should not be asserted. 

42 The assertion of selected byte enables during write operations eliminates the need for 

43 a read operation. 

44 • Read Operations for all memories: The state machines will assert all the byte 

45 enables during all read operations. 

46 • Write Operations: 
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1 o Enqueue Memory: When the queue becomes empty after a dequeue 

2 operation, the tail pointer in the enqueue memory for that FID has to be 

3 written as NULL. When writing that location, the byte enable to bits 27 

4 downto 0 will only be asserted. 

5 o Dequeue Memory: When the queue is empty and the there is an enqueue 

6 operation, the head pointer in the de-queue memory for that FID has to be 

7 updated it. When writing that location, the byte enable to bits 36 downto 0 

8 will only be asserted because the TYPE comes with the FID. 

9 o BID Memory: When the NULL pointer is written at the end of an enqueue 

10 operation, the BID memory location has no relevant data. All byte enables 

1 1 will be asserted. 

12 o Statistics Memory: The dequeue engine has to decrement the Qsize of that 

13 FID after a dequeue operation. The STAT memory is read to get the value. 

14 Therefore, the all byte enables are asserted. 
15 

16 Global Synch and internal Time slot generation 

17 The global synch is provided for all the blocks. The block has to generate locally 

18 eight time slots. The memories controllers access these memories only during certain 

19 time slots. It is important that the all tasks are completed in one cycle of eight time slots 

20 or clocks. The tasks can take longer than eight cycles to complete. But the caveat is that a 

21 task has to start every eight cycles or time slots. The latency is longer but the bandwidth 

22 is the same. The scenario complicates the data dependency of all the concurrent tasks and 

23 sequential tasks. 
24 

25 External and Internal Memory Initialization 

26 On power up the memories are not initialized. There is an initialize signal that 



27 when is set by the PCI, it triggers state machine that will write a zero pattern in all 

28 memory locations of all memories except the BID memory portion of the free buffer lists. 

29 In the free buffer lists, the locations will be initialized to a value that reflects the previous 

30 BIDS. The first location of the free buffer list will have the value X"00000000". The 

31 second location will have the value X"00000001". The rest of the locations will have an 

32 incrementing values. These will be the free buffer values that are used for BIDS. The first 

33 BID "000000" is not used. It is the "NULL" pointer. 
34 

35 Data Dependencies and Coherency 



36 The data and the FID of both the enqueue and dequeue previous operation that has 

37 just finished have to be saved. The current FID has to be checked against the previous 

38 ones. If the current FID is the same as the previous one of either the enqueue and dequeue 

39 operations, then the data that was saved will be used. The data in memory is not the 

40 correct one. 

41 There are several scenarios where the data coherency will get compromised. 

42 The data coherency will get compromised with consecutive enqueue operations to same 

43 FID: 

44 • When the consecutive enqueue operations are for the same FID the statistics 

45 counters should be updated form internal registers. 



105 



AZA-001/2001-P00^^ ^ PATENT 

1 • When the consecutive enqueue operations are for the same FID the tail pointer 

2 should be updated from internal register. 

3 • When the consecutive enqueue operations are for the same FID the RED 

4 Calculations should be based on data in internal registers. 

5 • When the consecutive enqueue operations are for the same FID the Q Size should 

6 be incremented based on data in internal register, then written in the statistics 

7 memory. 

8 The data coherency will get compromised with consecutive dequeue operations to same 

9 FID: 

10 • When the consecutive dequeue operations are for the same FID the head pointer 

1 1 should be updated from internal register. 

12 • When the consecutive dequeue operations are for the same FID the Q Size should 

13 be decremented based on data in internal register then written in the statistics 

14 memory. 

15 The data coherency will get compromised with enqueue and dequeue operations to the 

16 same FID: 

17 • The only time that is compromised when there is an enqueue and a dequeue 

18 operation to the same FID. If the dequeue operation renders the queue empty and 

19 a "NULL" has to be written as the tail in enqueue memory and the head in the 

20 dequeue memory. Instead with the enqueue operation, the queue will not be 

21 empty any longer. The enqueue operation will preempt the dequeue operation and 

22 the new BID will be the tail and the head pointer. 

23 • The Qsize in the statistics memory has to be update correctly. The number of cells 

24 is decremented by one before it is added to the Qsize. 

25 The data coherency will get compromised with enqueue and timeout operations to the 

26 same FID: 

27 • The data written to the FID memory has to be the enqueue one. The TTL 

28 increment and the discard operation have to be bypassed. There will be no 

29 discards of the queue. The TTL will be written as "00". 

30 See Figure 287. 

31 Statistics 

32 The statistics memory is accessed when there is an enqueue, dequeue and CPU 

33 operations. All the requests are issued simultaneously. 

34 The enqueue will update the various counters depending on the conmiand issued by the 

35 memory manager. 

36 • Enqueue conmiand will increment by the number of cells the Q Size counter on 

37 EOP. 

38 • Enqueue command will increment the packet counter on EOP. 

39 • Enqueue conmiand will increment the byte size counter on command. 

40 • Discard command will increment the others discard counter. 

41 The dequeue command will decrement by one the Q Size counter always. 

42 The RED will increment the RED discard counter if the cell is dropped on EOP. 
43 
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1 NULL Pointer 

2 The value of '7FFFFF' is the "Null" pointer when the BID memories are all 

3 populated. 
4 

5 Shaper and Scheduler Commands flow 

6 When it receives an EOP of an FID, then the PFQ will send a shape conmiand to 

7 the shaper with the cell count of that packet. The Cell or packet indication has to have the 

8 correct value. 

9 In case of ATM cells, the PFQ will send to the shaper a command to shape the 
10 FID cell. 

11 

12 External Interface 

13 The external interface will be a standardized interface complying with the global 

14 SYNC. There are four interfaces between the PFQ and the external worid. External 

15 memories are part of the PFQ and the interface is considered internally for all practical 

16 purposes. 

17 • Enqueue a FID. The memory manger drives the request. The PFQ will respond 

18 with a BE) and a valid signal. 

19 o The memory manager to PFQ command is issued in CLK 6. 

20 o The PFQ to memory manger response is issued in CLK 6, next synch. 

21 • Discard a FID. 

22 o The memory manager to PFQ command is issued in CLK 6. 

23 • Release BID. 

24 o The memory manager to PFQ command is issued in CLK 6. 

25 • The second interface between the PFQ and memory manager is to dequeue an 

26 FID. The PFQ drives the dequeue request to the memory manager. 

27 o The PFQ to memory manager command is issued in CLK 6. 

28 • There is an interface between the PFQ and the shaper. The PFQ drives the request 

29 to the shaper. 

30 o The PFQ to shaper conmiands are issued in CLK 6. 

31 • There is an interface between the PFQ and the scheduler. The scheduler drives the 

32 requests to the PFQ. The request is to dequeue a BID of an FED. 

33 o The scheduler to PFQ command is issued in CLK 7. 
34 

35 Memory Structure 

36 

37 RED Attribute Memory 

38 This is internal memory. There are IK entries of this memory. The software writes 



39 it only. The address is the "RED_ASSOCIATION" from the FID memory. See Figures 

40 288 and 289. 
41 

42 Statistics Memory 

43 This is an external memory. There are 2 MEG locations. There are 2 locations per 

44 FID. The software writes only the port number. The rest is written by the logic. The CPU 

45 reads the counter values. The address is the "FID" for bits 20 downto Iwith bit 0 as "0" 

46 or"r'. 
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1 See Figures 290, 291 and 292.. 

2 All discard counters are updated on EOF. If the packet is being dropped because 

3 of RED and the EOF cell has a discard command instead of enqueue, then the RED 

4 counter is not update it. 
5 

6 FID Dequeue Memory 

7 This is an external memory. There are 1 Meg locations. There is one location per 

8 FID. The software does not write any location. The software under normal operating 

9 traffic conditions does not need any values from this memory. The address is the "FID". 
10 See Figures 293 and 294. 

11 

1 2 FID Enqueue Memory 

13 This is an external memory. There are 2 MEG locations. There are 2 locations per 

14 FID. The software writes only the RED association. The software under normal operating 

15 traffic conditions does not need any values from this memory. The address is the "FID" 

16 for bits 20 downto 1 with bit 0 as "0" or "1". See Figures 295, 296 and 297. 
17 

18 Buffer ID Data Structure 

19 This is an external memory. There are 8 MEG locations. There is one location per 

20 BID. The software writes the count only. The count cannot change when the queue is not 

21 empty. Hence there is no need for a separate byte write enable. The address is the "BID" 

22 tail or head that are read from the dequeue memory, enqueue memory or the free buffer 

23 register. See Figures 298 and 299. 

24 The SOP, EFCI, 0AM are set when the SOP cell arrives. The EOP is set in a 

25 different BID when it arrives. There is no need to write extra cycles on EOP. 
26 

27 Multicasting Data Structure 

28 These are internal memories. 

29 

30 Multicasting FID LEAF Data Structure 

31 

32 Multicasting FID Dequeue Memory 

33 There are 1 K locations. There is one location per FID. See Figures 300 and 301. 
34 

35 Multicasting FID Enqueue Memory 

36 There are 2 K locations. There are two locations per FID. There are 2 locations 

37 per FID. The software writes only the RED, and "NEXT_TIJN". The software under 

38 normal operating traffic conditions does not need any values from this memory. See 

39 Figures 302, 303 and 304. 



40 
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1 Multicasting FID ROOT Data Structure 



2 The enqueue engine uses the next table. There are 1 K locations. The software 

3 writes the head and the tail. See Figure 305. 
4 

5 Tunneling Data Structure 

6 The memories are internal. 



7 

8 
9 
10 

1 1 Tunneling FID LEAF Data Structure 



12 Tunneling FID Dequeue Memory 

13 There are 1 K locations. There is one location per FID. There is one location per 

14 FID. The software does not write any location except the "TUN_ROOT". The software 

15 under normal operating traffic conditions does not need any values from this memory. 

16 See Figure 306. 
17 

18 Tunneling FID Enqueue Memory 

19 There are 2 K locations. There are 2 locations per FID. There are 2 locations per 

20 FID. The software writes only the RED, and "NEXT_TUN". The software under normal 

21 operating traffic conditions does not need any values from this memory. See Figures 

22 307, 308 and 309. 

23 

24 Tunneling FID ROOT Data Structure 

25 The de-queue engine uses the next table. There are 1 K locations. The software 

26 writes the head and the tail. See Figure 310. 
27 

28 Interface 

29 The interface is either internally to the memory manager, shaper or the scheduler. 



30 Else, it is to the external rams. The BYTE enable for the enqueue memory should be 

31 connected that the "BYTE_ENABLE_0" is connected to the first three bytes that contain 

32 the BID tail pointer. The second "BYTE_ENABLE_0" should be connected to the rest of 

33 bytes. When writing the BID tail pointer with the value "NULL" by the dequeue engine, 

34 there is no need for a read operation, instead only the first three bytes are written. The 

35 rest of the data field is maintained as it was. The dequeue memory has no relevant data 

36 that need to be maintained when the enqueue engine updates the BID head pointer after 

37 empty queue. The type field is updated with the pointer. The 'TYPE" is part of the 



109 



AZA-OOl/2001-POO 




PATENT 



1 command. See Figure 3 1 1 . The signal "MEM_PFQ_FULL" is synchronized and bussed 

2 through to the DBS. 

3 

4 CPU Interface 

5 There are signals that are generated by the CPU interface and are used to control 

6 the memory sizes and the enable and disable signals. See Figure 312. 
7 

8 Segmentation Interface 

9 See Figure 313. The PFQ of an egress device sends a message to the segmentation 



10 engine of Ingress devices when the number of used buffers exceeds a programmed 

1 1 threshold. The data will contain the port address of the egress device. The segmentation 

12 engine will multicast the message to all the input devices that instructs these devices to 

13 refrain from sending any traffic to the device of the corresponding port. See Figure 314. 

14 The message indicates the condition when the number of buffers used exceeds the 

15 threshold value by setting the flag to 4'. There is another message that indicates when 

16 the condition no longer exists by setting the flag to "0". The rest of the data bits indicates 

17 the port number that is programmed by the CPU. 
18 

19 DBS & CBWFO Interface 



20 See Figure 315. 

21 DBS Interface 

22 See Figure 316. 
23 

24 Memory Manager Interface 

25 A command is stretched to few cycles as described below so that the number of 



26 pins is reduced. The interface between the 2 blocks is shown in Figures 317 and 318. 

27 The En-queue/Discard command from the memory manager to the PFQ takes 2 cycles to 

28 complete. The De-queue command from the PFQ to the memory manager takes 3 cycles 

29 to complete. When the BID is not valid, the "NULL" BID will be returned to the memory 

30 manager as the BID value. The memory manager will drop the cell whenever the BID is 

31 "NULL". See Figures 319 and 320. 
32 

33 MEM to PFQ 



34 The memory manager starts the command by asserting valid signal. The PFQ will 

35 always sample the command. If it is not NOP, then the command determines the number 

36 of valid cycles with data. If the command is a "discard" command, then the only valid 

37 parameters are the FID; all the rest of the parameters should be ignored by the PFQ. If the 

38 command is a release then the BID is important and the rest is ignored. If it is an enqueue 

39 command, then both Dl and D2 are valid. See Figure 321. 

40 The memory manager as well as the PFQ should have FIFOS to store the 

41 commands and the parameters. 
42 

43 PFQ to MEM 

44 The command takes 3 cycles to complete. In this command the data fields are 

45 transferred in 3 cycles as shown in Figure 322. 
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1 If "REL" is set, then the memory manger can reuse the buffer, which is the BID. 

2 As a result, the memory manger cannot send a command to PFQ releasing BID that was 

3 dequeued with the "REL" bit reset. The PFQ will not check the "DEQ CNT" in the BID 

4 memory. The "DEQ CNT" is used for multicasting traffic multiple times. 
5 

6 Interface Timing between the Memory Manger and PFQ 

7 The MXl and MX2 are not synchronous. The global synch signal is generated 

8 locally. As a result, the time slot is not adhered to in this interface. All the commands or 

9 data are stored in FIFOS after they reach the receiving device. The response to such a 

10 request will take between 16 and 24 clock cycles. For instance, the memory manger 

1 1 needs to wait between 16 and 24 clocks before it gets a BID after it asserts the FID 

12 enqueue command. See Figure 323. 
13 

14 Memory Manager Interface Timing 

15 See Figure 324. The clock is 100 MHZ and the commands from the PFQ to the 

16 memory manager are valid every 4 clocks. Similarly, the commands from the memory 

17 manager to the PFQ are valid every 4 clocks. As a result, the PFQ block and the memory 

18 manager block will receive a command every 8 clocks of 200 MHZ. 
19 

20 SEGMENTATION Interface Timing 

21 See Figure 325. The clock is 100 MHZ and the data is serially driven on two data 

22 pins. The flag indicates if the queue is full when it is "1". When the queue is full and the 

23 number of buffers used is equal or exceeds a programmable threshold, the PFQ will send 

24 a message indicating the condition by setting the flag to "1". When the number of buffers 

25 used is less than the programmed threshold by 10, then an equivalent message is sent but 

26 with the flag reset. 
27 

28 Software Interface 

29 The CPU interface doesn't use bursting. Data bus is still 32 bits. 
30 

31 Features 

32 • Data bus of 32 bits. There is data bus in and out. 

33 • Single address access (not a burst access). 

34 • CPU accesses all registers in all blocks via direct read/write commands. 

35 • All external memories and most of the internal memories are accessed via indirect 

36 accesses through registers 

37 • CPU accesses all internal/external memories in all blocks via indirect accesses using 

38 the general-purpose registers dedicated for these accesses. 
39 

40 Address space 

41 The address space is 64 entries only. The internal registers directly mapped to 

42 these addresses and external/internal memories are indirectly mapped. 

43 The address spaces per lookup engine is shown in Figure 326. 



44 
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1 Direct/Indirect Accesses 

2 A direct access is used to access. If there are internal memories or registers, which 

3 are longer then 32 bits, then read or write to these locations will be executed in more then 

4 one access. For example an entry in internal memory of 50 bits width (or a register with 

5 the same length) will be accessed in two CPU commands first access bits [31:0] and then 

6 [49:32]. Each register or small internal memories have their own address, which is unique 

7 to the entire chip set. 

8 An indirect access is used to access internal and external memories. Each block 

9 has a set of registers dedicated for the indirect access. One register contains the address 

10 and the command (Rd, Wr, Init etc). Few other registers contain the data for a write 

1 1 command. Writing to the COM/ ADD register triggers the execution of the command, 

12 once the block completed the execution of the command, the COM portion of the 

13 command is cleared by the block. When a read command occurs the data is written to the 

14 registers and then the CPU reads the registers to get the data. 
15 

16 Registers 



17 See Figure 327. 

18 

19 Commands 

20 NOP. 

21 See Figure 328. 

22 Read FID Enqueue Memory (72 bits) - Up to 2M locations. 

23 See Figure 329. 

24 Write FID Enqueue Memory (72 bits) - Up to 2M locations. 

25 See Figure 330. 

26 Read FID Dequeue Memory (36 bits)- Up to IM locations. 

27 See Figure 331. 

28 Write FID Dequeue Memory (36 bits) - Up to IM locations. 

29 See Figure 332. 

30 Read BID Memory (36 bits) - Up to 16M locations. 

31 See Figure 333. 
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1 Write BID Memory (36 bits) - Up to 16M locations. 

2 See Figure 334. 

3 Read Stat Memory (72 bits) - Up to 2M locations. 

4 See Figure 335. 

5 Write Stat Memory (72 bits) - Up to 2M locations. 

6 See Figure 336. 

7 Setup connection command for FID 

8 See Figure 337. This command sets up a connection by accessing external 

9 memories and initializes the corresponding locations to the proper parameters. It is a 
10 write command. Clear PFQ entries for ENQ, DEQ and STAT! 

11 

12 Tear-Down connection command for FID 

13 See Figure 338. This conunand tears down a connection by accessing external 

14 memories. It is a write command. The data is all "0" 

15 PFQ returns all remaining BIDS on the queue of the down connection to the free 

1 6 link list during this command. 

17 

1 8 INIT FID MEMORIES 

19 The memories are the enqueue, dequeue, and statistics memories. See Figure 339. 
20 

21 INIT BID Memory 

22 This will build the free buffer list. See Figure 340. 
23 

24 Reserved Op-codes 

25 The op-codes 1101, 1110, and 1111 are reserved. 
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1 

2 General commands 

3 The following commands are combinations of the above simple commands. A 

4 user command like this should be broken to several Read or Write commands in the 

5 driver. 
6 

7 Initialized External memories 

8 • Write "0" in all memory locations. 

9 • Write "NULL" in the enqueue memory. 

10 • Write "NULL" in the dequeue memory. 

1 1 • Build the free buffer list by linking each buffer to the next one. 

12 

13 Initialized Internal memories 

14 • Write "NULL" in the Multicast Root memory. 

1 5 • Write "NULL" in the Multicast Leaf memory. 

16 • Write "NULL" in the Tunneling Root memory. 

17 • Write "NULL" in the Tunneling Leaf memory. 
18 

19 Principle Of Operation 

20 The PFQ performs the following operations: 

21 • PFQ adds a BID to the FIDS queues. In essence it enqueues a BID. 

22 • PFQ removes a BID from the FIDS queues. In essence it dequeues a BID. 

23 • PFQ adds a BID to the free buffer queue. In essence it de allocates or returns a 

24 free BID. 

25 • PFQ removes a BID from the free buffer queue. In essence it dequeues or 

26 allocates a free BID. 

27 • PFQ increments the statistics counters. 

28 • PFQ timeouts the active FIDS. 

29 • PFQ asserts commands to the shaper and memory manager. 

30 • PFQ accepts commands from the scheduler and memory manager. 

3 1 • PFQ transfers statistics to the CPU. 

32 • PFQ executes early random discards depending on criteria that associate the FID 

33 with a counter and an algorithm. 

34 • PFQ executes discard operations. 

35 • The Memory manager issues the enqueue, discard and the release. 

36 • The scheduler issues the dequeue conmiand. 

37 The Per flow queue will get a command from the memory manager to enqueue an 

38 FID. The PFQ rejects or honors the request depending on the free buffers available or 

39 RED. The request is honored and the enqueue operation is performed. If the request is 

40 rejected, then a discard operation is performed by making the tail of the discarded packet 
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1 as the tail of the free buffer list. Also, the head of the packet will be linked to the previous 

2 free buffer tail. 

3 The scheduler commands are dequeue ones. The PFQ will remove the BID from the 

4 queue and transfer the information to the memory manager. 
5 

6 Error Conditions 

7 The total numbers of free buffers is compared to the number of buffers used and 

8 also to the total number of buffers on the queues. If the numbers do not match, then there 

9 will be an abort of all the operations. The PFQ will indicate the condition to the CPU. 

10 Another error condition happens when there is a dequeue request and the queue is empty. 
11 

12 TIMING DIAGRAMS 

13 The timing diagrams are for FED and BID memory accesses. They include the 

14 accesses of the FID memory as well as the BID memory for enqueueing, dequeueing, 

15 statistics collections and timeout scanning. See Figure 341. 

16 The memories cycles are summarized in Figures 342-346. 

17 When dequeues engine is updating the tail pointer in the enqueue memory to 

18 "NULL", there is one write operation that is performed to the enqueue memory location 

19 0. The BYTE ENABLE for the first 24 bits should be used. All the other byte enables 

20 should not be asserted. 

21 When the enqueue engine is updating the head pointer in the enqueue memory to 

22 "BID", there is one write operation that is performed to the dequeue memory. The BYTE 

23 ENABLE for the first 24 bits should be used. All the other byte enables should not be 

24 asserted. 
25 

26 Per Flow Commands 

27 • The Memory manager command to PFQ: 

28 o ENQ { FID } : Enqueue the FLOW. 

29 o DSCD { FID } : Discard the FE). 

30 o REL {BID}: It releases the BID. 

31 • The PFQ Commands to the memory manager: 

32 o SND { BE) } { REL } { PORT } : Get the data cell belonging to BID and send it 

33 with port number "PORT", afterwards release the buffer if "REL" is set. 

34 o VALID {BID } : Accept the BID for the previous enqueue command. 

35 • The PFQ Commands to DBS: 

36 o SHAPE_OR_SCHEDULE {FID} {BID} {LOCATION} {#CELLS } {EOP}: 

37 Shape the current Flow ID FID with the location "LOCATION", EOP marker 

38 received even though should be implied. 

39 • The DBS Commands to PFQ: 

40 o DEQ { FID } : The PFQ will pass the Buffer ID to the memory manager. If 

41 EOP is set, then send a shape command to shaper. 

42 • The PCI Commands to PFQ: 

43 o Get [ADR] { CNTS } : Get the counts values from the statistics memory. 



44 
45 
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1 Data Base 

2 The data base block is responsible for managing the packets link list per FlowID 

3 for FlowIDs in the Shaper and FlowIDs in the scheduler. The block receives an En-queue 

4 command from the PFQ and sends the FlowID either to the Shaper or to the Scheduler. A 

5 De-queue command starts with a port selection based on a port calendar and then a 

6 FlowID is selected between the scheduler and the shaper. 

7 Shaper traffic has priority over scheduler traffic. Shaper traffic is divided to 8 

8 classes of strict priority, once all shaper' s classes are empty for a given port, the 

9 scheduler is allowed to schedule. 
10 

11 Features 

12 1. This block operates at 200MHz clock. 

13 2. Support up to IM FlowIDs. 

14 3. Support a link list of packets/cells up to 8M cells. 

15 4. Performs the following in one port slot (8 cycles): 

16 a. One input phase from the PFQ and redirect to an input phase in the Shaper 

17 or in the Scheduler or to both (when traffic is just metered) 

18 b. One output phase of the Shaper. 

19 c. One output phase of the Scheduler. 

20 5. Support 8 strict priority classes for the shaped traffic per port. 

21 6. Performs port scheduling for the shaped traffic base of strict priority within the 

22 block. 
23 

24 Interface 

25 A global block diagram of the database block is shown in Figure 347. 

26 Figure 348 shows the interface to/from the data base block. See also Figure 349. 
27 

28 Functional Description 

29 The DBS block acts as the holder of the information per FlowID and the packets 

30 within the FlowID for the Shaper and the Scheduler blocks. The DBS block can be 

31 divided to several processes during one port slot: 
32 

33 PFQ Input Phase 

34 For a packet traffic the PFQ first assembles a complete packet and then transfer 

35 the packet to the DBS block, when in a cells traffic, a cell is being sent to the DBS each 

36 time it arrives. One cell in cells traffic is considered as a packet with a length of one cell. 

37 When a packet arrives to the DBS block from the PFQ, the DBS first links the packet to 

38 the FlowID and also generates an input phase to the Shaper or to the Scheduler or to both 

39 based on a parameter bit per FlowH). 
40 

4 1 Shaper Output Phase 

42 The output phase of the Shaper is being performed on a cell basis, means each cell 

43 of a packet generates an output phase from the Shaper. Once a cell is driven from the 

44 Shaper, the DBS block decrement the cell counter for the specific packet. If it's the last 

45 cell of the packet, the DBS links the packet to the shaper' s output memory per port. The 
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1 DBS also generates an empty indication to the Shaper if there are no more packets to 

2 shape for the FlowID. 
3 

4 Port Calendar 

5 A port calendar is a free running port selection based on a memory programmed 

6 by the user. Once a port is selected for the current output phase, the port number is 

7 transferred to both scheduler and shaper' s output memory. Both blocks select their output 

8 FlowID for the output stage. If the shaper' s output memory has a valid FlowID and the 

9 scheduler is not in the middle of a packet, then the FlowID to be driven to the PFQ is 

10 always from the shaper' s output memory. Shaped traffic has priority over un-shaped 

11 traffic. 

12 Each output phase that starts with the port calendar also starts with checking the 

13 port status. Only if the port calendar has a valid port and that port is not full according to 

14 the port status, a valid port is driven to the scheduler and to the shaper. 
15 

16 Port Status 

17 This block receives the status information from the output SPI interface for 64 

18 ports plus one CPU port. See the timing diagram of that interface below. 
19 

20 Scheduler Output Phase 

21 Once a port is selected, the scheduler performs it's own output phase. The DBS 

22 block needs to support this phase the same way it supports the shaper' s output phase. 
23 

24 Block Description 

25 The following are the algorithms to perform in each of the previous phases: 
26 

27 PFQ Output Phase 

28 1. The PFQ drives the following signals: VALID, FID, NCELL, BID, CLPl 

29 2. Read FID_mem[FID] { rp, wp, e, cell_cnt, ncell, dp, port, qos, c_p, 

30 CPU j)ort, shape} 

31 3. n_wp<=BID 

32 4. n_e<=0 

33 5. if(e==l) 

34 I) if (shape _meter==0 \ shape==I) perform input phase to Shaper: 

35 VALID, FID, qos, ncell 

36 2) if(shape_meter==0 \ shape==0) Perform input phase to Scheduler: 

37 VALID. FID, qos, CPU_port 

38 3) n_rp<=BID 

39 4) n_ncell <= NCELL 

40 5) n_cell_cnt <= NCELL 

41 6) n_clp<=CLPI 

42 7) pkt_next_we = 0 

43 6. else 

44 1) if (shape _meter"0 ) perform input phase to Shaper: 

45 VALID, FID, qos. ncell 

46 2) n_rp <- rp 
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1 3) n_ncell <= ncell 

2 4) njoelljcnt <-cell_cnt 

3 5) njclp <- dp 

4 6) pkt_next_we <= 1 

5 7. if[VALID=^l ) Write FID^mem[FID] <= 

6 {n_rp, n_wp, n_e, n^ncell n_cell_cnt, n_clp, qos, c_p, CPU_port, shape} 
1 8. if[VALID==l & pkt_next_we=^l) Write pkt_mem[wp] <= 

8 {ncelh dp, FID, next, [dbs_dpkt_word_en[l:0] ^11] {NCELL, CLPI, 

9 0, BID} 
10 

1 1 Shaper Output Phase 

12 7. The SHP drives the following signals: VALID, FID, VIOLATOR 

13 2. Read FID_mem[FID] {rp, wp, e, cell_cnt, n_cell, dp, port, qos, c_p, 

14 CPU _port, shape} 

15 3. Wait until read is complete 

16 4. curjeop <= cell_cnt==l 

17 5. cur_empty <= curjsop & rp~=wp 

18 6. Read pkt_mem[rp] {ncell, dp, next, FID} 

19 7. /?earf shp_out jort[poH] {pjrp, p_wp, p_e} 

20 8. if[cur_eop-=I ) n_rp<= next; n_cell_cnt <- ncell; n_n_cell <- ncell; 

21 9. else n_rp<= rp; njcell_cnt <= cell_cnt -1; n_njcell <= njcell; 

22 10. ificurjsmpty ) nje <- 1 

23 lLn_FID<^FID 

24 12. n_clp <= en_dp_change ? (VIOLATOR \ dp) : dp 

25 13. if[cur_eop) 

26 shpjout _port_we <= 1 

27 n _p_e <= 0 

28 n _p_wp <= 

29 ifip^e) nj)_rp <= rp 

30 e/^e 

31 shp_out _j?ort_we <=0 

32 i5. 5enrf cur_empty indication to Shaper 

33 76. if(VALID==l ) Write FID_mem[FID] <= 

34 /«_rP^ n_cell_cnt, n_n_cell, qos, c_p, CPU_port, shape} 

35 77. if(VALID==l & cur_eop) 

36 Write shpjout _port[port] <= {n_pjrp, n_p_wp, n_p_e } 

37 78. if(VALID==l & cur_eop & p_e==0) 

38 Write pkt_mem[p_wp][next][dbsjipkt_workjen[I] =!]<={ rp} 

39 79. if(VALID==l & cur_eop) 

40 Wnte pktjnern[rp][dbsJipkt_work_en[0] =1] <- { n_clp, njcell, n_FID} 
41 

42 DBS Output Phase (Combined with Scheduler Output Phase) 

43 7 . Read portjcalendar mem[count] { portid, valid, jump} 

44 2. if(jump=-l) count <= 0 

45 3. else count = count + 7 

46 4. send shaper and scheduler: portid, valid 
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1 5. wait for 4 cycles to receive SHP^FID, SHP_VALID, SCH_FID, SCH^VALID, 

2 SCHJdID 

3 6. get PFQ_FULL indication from PFQ block 

4 7. invalid & -PFQ_FULL & SHP_VALID & (SCHJdID \ SCH_VALID)) 

5 1) out Jrom_shp <= 1 

6 2) out Jromjsch <= 0 

7 ifivalid &-PFQ_FVLL & SCH_VALID & (SHPJ/ALID \ SCHJdID) 

8 1) outjrom_shp<=0 

9 2) out Jrom_sch <= i 

10 5. if[out Jrom_shp=-l) 

11 1) Read shp _j)riority_mem[portid] ( empty} 

12 2) Select class per portid based on empty : class _addr = {portid, class} 

13 3) Read shpjout_mem[class_addr] f p_rp, p_wr, pje} 

14 4) Read pkt_mem[p_rp] (ncell, dp, next, FID} 

15 5) Send to PFQ: FID, portid, pjclp, valid( =1) 

16 6) If(ncell==l) 

17 a. n _p_rp <= next 

18 b. if(pjrp--p_wp) n_p_e <= 7 

19 c. n_ncell <= 0 

20 7) else 

21 a. n _p_rp <- p^rp 

22 b, n _j>_e <= p_e 

23 c. njtcell <= ncell - 1 

24 8) Write shp_out_mem[class_addr] <= fnj_rp, p_wp, n_p_e} 

25 9) If(n j>_e^-l ) Write shp jriority [portid] <= new empty indication 

26 10) Write pkt_mem[p_rp] <= f n_ncell} 
21 9. if{ outJrom_sch ==1) 

28 1) Read FID_mem[SCH_FID] 

29 {rp, wp, e, cell_cnt, n_cell port, qos, c_p, CPU_port, shape} 

30 2) eop <- (cel_cnt==l) 

31 3) empty <= icell_cnt~-l & rp==wp) 

32 4) busy <= PFQ^FULL 

33 5) Readpkt_mem[rp]{ncell dp, next,FID} 

34 6) Send eop, busy, empty to scheduler 

35 7) if(eop) njrp <- next; n_cell_cnt <= ncell; n_n_cell<= ncell; 

36 8) else n_cell_cnt <= celljont-l;n_ njoell <= njoell; n_rp <- rp; 

37 9) ifiempty) n_e <= I 

38 10) Write FlDjnem[SCHJlD] < = 

39 fn_rp, wp, n_e, njcell_cnt, njcell, port, qos, cj), CPUjyort, shape} 
40 

41 Port Status 

42 This interface drives the status of the 64 ports in the output SPI and the CPU port. 

43 It is a continuous interface. Every 8 cycles of 200MHz the status indication of the 65 

44 ports is updated to a 65 bits register. The first cycle of the eight is marked v^ith a specific 

45 signal (sync signal). Figure 350 shows the interface. 
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1 The database block needs to synchronize the incoming status bits to the local 

2 200MHz clock and then use the status bits. A synchronization FIFO is being used, but 

3 since the write and read frequencies are the same, there is no empty or full indications. 

4 Reads will always lag 3 cycles after the writes. To achieve that, the read pointer is always 

5 the value of the write pointer minus 3. Also the first 3 reads after reset should be ignored. 
6 

7 Full indication from PFQ 

8 A full indication from the PFQ can be asserted any time. Once a full indication is 

9 asserted, the DBS first complete the current output phase and then freeze the increments 

10 of the port calendar. From that moment until this indication is de-asserted no output 

1 1 phase is performed from the scheduler or from the shaper's output FIFOs. The shaper's 

12 output phase to the DBS should still continue without any connection to the full 

13 indication. 
14 

15 CLP Bit 

16 The CLP bit is modified in the Database and then is sent back to the PFQ during 

17 the output stage. Only FlowIDs that are shaped can have their CLP bit changed, FlowIDs 

18 to the scheduler have the CLP bit unchanged. The violation indication from the shaper 

19 should set the CLP bit to 1, if no violation indication arrives then the CLP bit is to remain 

20 the same as it was. 
21 

22 Memory Description 

23 

24 Chip-Select indication for the PKT Memory 

25 The packet memory can support up to 8M locations (the address bus has 23 bits). 

26 Since there are no SSRAM memories in the market that have that amount of entries, the 

27 DBS provides chip-select signals so that several memories can be connected to get up to 

28 8M locations. 

29 The chip-select signals are used only when supporting 2M locations and up. The 

30 generation of the chip-select depends on the input from tjie CPU interface called 

31 CPU_CS_SEL[1:0] as follow: 

32 00 - generate chip-select based on address[20: 19] 

33 01 - generate chip-select based on address [2 1 : 20] 

34 10 - generate chip-select based on address[22:21] 

35 1 1 - Reserved 

36 for example, if supporting 2M locations only and the memory devices used have a depth 

37 of 512K locations, then CPU_CS_SEL bus should have the value of 00. the 

38 chip_select[3:0] will have the following meaning: 

39 chip_select_l[0] - selects memory block for addresses 0 - 1/2M 

40 chip_select_l[l] - selects memory block for addresses 1/2M - IM 

41 chip_select_l[2] - selects memory block for addresses IM - 1 1/2M 

42 chip_select_l[3] - selects memory block for addresses 1 1/2M - 2M 

43 In this case address bus bits [18:0] are going to all memory blocks (Note, each block has 

44 only 512K location, which translated to 19 bits), the chip_selectj bus controls 4 memory 

45 blocks, therefore, the total locations supported is 2M. 
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1 

2 DBS FlowID Memory 

3 

4 FlowID Memory 



5 Address: FID, Length: IM, #bits: 91 

6 See Figure 351. The user has to initiahze the above fields, which are marked as 

7 software using setup connection command. 

8 

9 PKT Memory 

10 Address: BID, length: 8M, #bits: 55 

1 1 See Figure 352. 
12 

13 Port calendar memory 

14 Address: counter, length: 192, #bits: 8 

15 See Figure 353. The user should initiate this memory prior to any traffic based on the 

16 users input. 
17 

18 Shp_strict memory 

19 Address: port, length:64, #bits: 12 

20 See Figure 354. 
21 

22 Shp_out_port memory 

23 Address: port, length:512, #bits: 49 

24 See Figure 355 

25 . 

26 Registers Description 

27 See Figure 356. 
28 

29 CPU Commands 

30 

3 1 Read external FID memory 

32 See Figure 357. 
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1 A[0] - C_P (bit [6] in the register) 

2 A[ 1 ] - CPU_PORT (bit [7] in the register) 

3 Pri[2:0] - Priority for shaped traffic 

4 S - Shape (bit [23] in the register) 

5 C[0] - dpi (bit [10] in the register) 

6 C[l] - Empty (bit [1 1] in the register) 

7 

8 Write external FID memory 

9 See Figure 358. 

10 A[0] - C_P (bit [6] in the register) 

11 A[ 1 ] - CPU_PORT (bit [6] in the register) 

12 Pri[2:0] - Priority for shaped traffic 

13 S - Shape (bit [23] in the register) 

14 C[0] - dpi (bit [ 10] in the register) 

15 C[l] - Empty (bit [1 1] in the register) 

16 

17 Setup FID connection 

18 See Figure 359. 

19 A[0] - C_P (bit [6] in the register) 

20 A[ 1 ] - CPU_PORT (bit [6] in the register) 

21 Pri[2:0] - Priority for shaped traffic 

22 S - Shape (bit [23] in the register) 

23 During the setup connection command, the user defines the above fields only. 

24 The fields RP, WP, NCELL, CELL_CNT and CLP should be reset to 0 by the hardware 

25 The field EMPTY should be set to 1 by the hardware. 

26 

27 Read external PKT memory 

28 See Figure 360. 

29 L[0] - CLP 

30 L[l] - Reserved 

31 

32 Write external PKT memory 

33 See Figure 361. 

34 L[0] - CLP 
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I L[l] - Reserved 
2 

3 Read port calendar memory 

4 See Figure 362. 

5 Data[5:0] - portid 

6 Data[6] - valid 

7 Data[7] -jump 

8 

9 Write port calendar memory 

10 See Figure 363. 

II Data[5:0] - portid 

12 Data[6] - valid 

13 Data[7]-jump 

14 

1 5 Read SHP Strict memory 

16 See Figure 364. 

17 P[2:0] - previous QOS pointer 

1 8 P[3] - previous QOS pointer valid 

19 

20 Write SHP Strict memory 

21 See Figure 365. 
22 

23 Read SHP out memory 

24 See Figure 366. 
25 

26 Write SHP out memory 

27 See Figure 367. 
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1 

2 Init SHP_out_port memory 

3 See Figure 368. This command generates write accesses to all entries in the 

4 Shp_out memory (64 entries) with a specific data of: p_rp = 0, p_wp =: 0, p_e = 1 

5 

6 Init Shp_strict memory 

7 See Figure 369. This command generates write accesses to all entries in the 

8 Shp_strict memory (64 entries) with a specific data of: empty = 8'bl 1 1 1_1 1 1 1, 

9 prev_qos_v = 0, prev__qos = 0 
10 

1 1 Reserved Opcodes 

12 The opcodes: 1 1 10, 1 1 1 1 are reserved. 
13 

14 Timing Diagram 

15 Output phase to PFQ from SHP output memory. See Figure 370. 

Notes for Figure 370: 
PFQ Output, SHP Output, SCH Output DBS Outmit 

Notes : 

l.Slot 4 has data dependency on slot 6 if SHP FID and PFQ FID are same. Use following 
algorithm for this data dependency 

if (shp_FID == PFQ_FID & shp_out_valid & PFQ Jn_valid( 
@slot 6 empty <= 0 

wp <= BID(PFQ) 

if (shp_empty) rp <= BID(PFQ) 

else rp <= n_rp 

2. Slot 4 has data dependency on slot 7 if scheduler FID is same PFQ FID. In that case 
if (sch_FID == PFQ^FID && sch_out_valid & PFQ_in_valid) 
©slot 7 

empty <= 0; 

wp <= BID(PFQ) 

if (sch.empty) rp <= BID(PFQ) 

else rp <= n_rp; 

There are no data dependencies on the pkt memory since a packet can exist only in one 
place at a given time. 

16 
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1 Interfaces 

2 

3 Shaper 

4 See Figure 371. 
5 

6 Scheduler 

7 See Figure 372. 
8 

9 PFQ 

10 See Figure 373. 

11 

1 2 Output Scheduler 

13 The output scheduler receives unshaped flow IDs from the Data Base Module(it is 

14 referred as DBS) and links them in its database. Then it chooses a flow ID based on its 

15 scheduling algorithm and outputs the flow ID to the PFQ engine. Note that unshaped 

16 flow IDs will get available bandwidth which is not used by shaped FLOW IDs. Shaped 

17 flow IDs are scheduled for output by the DBS and these flow IDs get committed 

18 bandwidth. In other words Shaped flow IDs from the DBS always get priority over 

19 unshaped flow IDs from scheduler unless it is middle of packet. When DBS received a 

20 flow ID, flow ID is either sent to shaper(if it to be shaped) or scheduler(if it is not to be 

21 shaped). Once flow ID is received from DBS, scheduler links this flow ID based on the 

22 port number and QOS. Ports are always serviced in round robin fashion by the database. 

23 DBS informs the scheduler the port number which will be served in this time slot( 8 

24 clocks). If there are one or more flow IDs pending for this port then a flow ID is selected 

25 for scheduling based on the following criteria. 

26 The scheduler implements 2 levels of scheduling 



27 • Level 2: Weighted-Round-Robin algorithm combined with strict priorities and 

28 best effort to schedule QOSs. 

29 • Level 3: Round Robin between FlowIDs 

30 The scheduler also interfaces with a CPU through the software interface. The CPU 



31 can read or write the scheduler internal registers and all the memories. Furthermore, the 

32 scheduler receives flow control status from the re-assembly engine, the PFQ engine and 

33 the egress chip. 
34 

35 Features 

36 1. The scheduler operates at 200MHz clock. 

37 2. Round-robin algorithm between Flows. 

38 3. Combination of weighted round-robin algorithm and strict priority for QOSs. 

39 4. Support up to 1048576 Flows (IM). 

40 5. Support 192 port of STSl rate each up to one OC192 port. CPU port is always 0C3. 
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1 6. Supports up to 1024 QOSs that are shared between the output ports. 

2 • Up to 8 QOSs per port. CPU port has only one QOS, which is not accounted in 

3 the IK. 

4 • The 8^ QOS per each port is acting as a best effort QOS, means this QOS will be 

5 selected only if all other QOSs are empty. 

6 7. Two modes of operations: 

7 • Egress Mode: (Normal operation) Each port has up to 8 QOSs, there are 64 

8 output-ports plus 1 CPU port which has only one QOS. 

9 • Ingress Mode: Support only one output port. All QOSs belong to that one port. 

10 Virtually there are still up to 256 ports with up to 8 QOSs dedicated to each. A 

1 1 separate port status information bits will arrive from the Egress chip to block the 

12 QOSs in the Ingress chip, which send traffic to the congested ports in the Egress 

13 chip. This mode of operation affects the port calendar only. 

14 Figure 374 shows both applications with respect to the switch: 

15 8. Status for Ingress chip applications comes from the Egress chip. 

16 9. External memory supported is pipelined ZBT memory. Micron provides pipelined 

17 ZBT memories of 512K x 32/36 or IM x 18. 

18 10. In case of scheduling packets, only one packet can exist in the scheduler for a specific 

19 FlowID. Once this packet is scheduled the database block (DBS) will issue another 

20 packet of the same FlowID to the scheduler if there is one linked in the DBS. In case 

21 of scheduling ATM cells, each cell is treated as a one cell packet. 

22 11. Accept to link a flow from the DBS once every 8 cycles of 200MHz. 

23 12. Since 8 clock cycles are needed to service a port (a cell), a total of 520 (64 ports + 

24 CPU port) cycles are needed to keep up the lOGbs traffic rate. 
25 

26 Interface 

27 Figure 375 shows the interface to/from the scheduler block. Figure 376 shows the 

28 detailed interface to other blocks. Signal Descriptions are found in Figure 377. 
29 

30 Functional description 

3 1 Flow ID is given to scheduler by DBS if flow ID given by PFQ is not to be 



32 shaped. Based on a port and a QOS a flow ED belongs to, the scheduler informs DBS to 

33 link up the flow ID in the DBS's linked list memory structure. DBS selects port to serve 

34 and inform scheduler port number which will be served in the given slot. Scheduler then 

35 selects a QOS based on the strict priorirty or WRR algorithm among all the QOSs of the 

36 port, and finally chooses a flow ID based on the round robin algorithm between all the 

37 flows of the QOS. The scheduler sends the scheduled flow ID to DBS for scheduling. At 

38 this point DBS decides whether flow ID from scheduler will be sent out or flow ID from 

39 shaper will be sent out. Always if there is flow ID from shaped traffic for this port, it will 

40 be sent out first. In that case DBS informs scheduler not to update any parameters as flow 

41 id from scheduler has not been scheduled. When there is no flow ID from shaped traffic 

42 for this port, then flow ID from scheduler will be sent out. In that case, scheduler updates 

43 all parameters. Also if the current cell scheduled is the last cell (in case of ATM traffic, 

44 Every cell sent out will be marked as eop ) then End Of Packet(eop) signal is sent from 

45 DBS to scheduler. Also if the current packet is the last packet linked for this flow ID, 



126 



AZA-00 1/2001 -POO, 




PATENT 



1 empty will be asserted by DBS to scheduler. If it eop and not empty then scheduler has to 

2 rotate flow IDs. Scheduler will link current read pointer(f_rp) to end of link(f_wp). This 

3 information is sent to DBS. Also next_FID given by DBS will become head of link list 

4 for this QOS. 

5 The following explains all the scheduling algorithms employed by the scheduler 

6 and the scheduler's linked list memory structure. 
7 

8 Round Robin Algorithm 

9 All the elements to be scheduled are linked together in one link list. There are two 

10 pointers to that link list, a head pointer and a tail pointer. A selection is basically 

1 1 forwarding the head pointer. There are 2 options to continue the algorithm after selecting 

12 the element: the element is linked to the end of the list and become the tail pointer while 

13 the next element becomes the head of the list. The second option is that the element is 

14 removed from the link list, in this case only the pointer is changed to the next element in 

15 the list. See Figure 378. 

16 A Round Robin algorithm is used for the FlowIDs. The port calendar is 

17 implementing the same but without removing or adding elements to the link list, means a 

18 fixed Hst. 



19 

20 Weighted-Round-Robin Algorithm 



21 This algorithm is an extension to the Round Robin one. Each element in the link 

22 list has a weight and will be selected as much as the weight allows. This algorithm should 

23 generate a relative bandwidth to each element according to the total elements linked: 

24 BWk = Weightk / IWeighti 



25 To implement this algorithm there is a need for two link lists. Active link list, which 

26 holds the active elements to schedule, once an element k was schedule weightk times, it is 

27 moved to the waiting list. Once all the elements in the active list exhausted their weight 

28 and moved to the waiting list (active list becomes empty) the waiting list becomes the 

29 active list and the above formula is performed on the elements in the new active list. 

30 During processing of the active list, if a new element arrives, it is linked to the waiting 

3 1 list and will participate the scheduling only after the current cycle of the above formula is 

32 done. Each time an element is moved to the waiting list, the weight count will be set to 

33 the original weight. See the example in Figure 379. This scheduling scheme is used to 

34 schedule the QOSs per port. 
35 

36 Smoothing the bursts 



37 The WRR algorithm can cause long bursts of cells for the same FlowID/QOS if 

38 the weight is high. In order to make smaller bursts of data for the same FlowID/QOS, the 

39 QOSs are rotating in the active list several times before exhausting their weight. A Factor 

40 parameter is introduced per port, the weight of all QOSs is divided by the Factor and the 

41 result is the number of times they will be scheduled before moving to the end of the 

42 active list. For an example see Figure 380. 
43 

44 Negative Weight 

45 When scheduling packets, the QOS of the specific FlowID will be scheduled until 

46 an EOP indication arrives, it is possible that the packet is bigger then the weight of the 
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1 QOS so that the count can become negative. Once EOF arrives if the weight count is still 

2 positive, the QOS can be scheduled for another packet. If the weight count is negative, 

3 the QOS is linked to the waiting list. The weight of the QOS is added to the weight count 

4 as well. Once he waiting list becomes active and the QOS is to be scheduled, if the 

5 weight count is still negative weight will be added again and the QOS will be sent to the 

6 waiting list again without scheduling. 

7 This is similar to a negative credit, if a QOS consumes all the credits and beyond 

8 it has to accumulate enough credits in order to start scheduling again. 

9 Best effort QOS 

10 This is another flavor for the selection of the QOS per port. A dedicated QOS 

1 1 (Q0S7) is called the best effort QOS. It does not participate in the strict priority scheme 

12 or in the weighted-round-robin scheme. This QOS is scheduled only if all QOSs that 

13 belong to the strict priority are empty and all the QOSs that belong to the WRR are empty 

14 as well. A simple way to describe it is that this QOS is scheduled only if all other QOSs 

15 are empty. 
16 

17 Strict Priority 

1 8 Select a QOS can be done either by WRR algorithm or by assigning priorities to 

19 QOSs. Each port can assign priorities and WRR QOSs as shown in Figure 381 . 
20 

21 Selecting QOS 

22 As shown in Figure 382 there will be two QOS queues maintained per port. One 

23 is active list and other is waiting list. When a flow ID arrives from DBS, it will always be 

24 put in waiting list. Since there are eight QOSs per port each of this list is 8-bit wide and 

25 they are called QA_EMPTY(active list) and QW_EMPTY(waiting list). When 

26 corresponding bit is 1 then this QOS is empty and nothing to schedule. When 

27 QA_EMPTY, is all ls(no QOS to schedule) QW_EMPTY is copied to QA_EMPTY, and 

28 corresponding bits QW_EMPTY are made 1 . Copying is done only for round robin 

29 priority QOSs. Strict priority QOSs will always remain in waiting list. At any point, 

30 when the port is to be serviced, based on the priority allocation for that port, first checked 

31 to find out if there are any strict priority QOSs are not empty in waiting list. If so then 

32 these QOSs are served first. Only when all strict priority QOSs are empty, then round 

33 robin QOSs are selected from active list. At any time if eop is sampled for the current 

34 QOS, this QOS will be served till eop irrespective of priority. Following algorithm 

35 explains how to select QOS for servicing once port is selected. 

36 if (prev_qos_v == valid) then current_qos = prev_qos 

37 else if (priority[0] & ~qw_empty[0]) current_qos = 0 

38 else if (priority[ 1] & ~qw_empty[l]) current_qos = 1 

39 else if (priority[2] & --qw_empty[2]) current_qos = 2 

40 else if (priority[3] & ~qw_empty[3]) current_qos = 3 

41 else if (priority[4] & ~qw_empty[4]) current_qos = 4 

42 else if (priority[5] & -qw_empty[5]) current_qos = 5 

43 else if (priority[6] & ~qw_empty[6]) current_qos = 6 

44 else if (priority[7] & ~qw_empty[7]) current_qos = 7 

45 else if (weight_mf !=0 & qa_empty[active_ptr] = 0) 

46 current_qos = active_ptr 
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1 else if (&qa_empty[7:0]=0) 

2 // repeat these steps for 0 to 7 

3 case (prev_qos) // synopsys paralleLcase 

4 000: 

5 caseClbO) 

6 qa_empty[l] : current_qos = 1 

7 qa_empty[2] : current_qos = 2 

8 qa_empty[3] : current_qos = 3 

9 qa_empty[4] : current_qos = 4 

10 qa_empty[5] : current_qos = 5 

1 1 qa_empty[6] : current_qos = 6 

12 qa_empty[7] : current_qos = 7 

13 qa_empty[0] : current_qos = 0 
14 

15 To load pending QOSs from waiting to active. This function is performed any 

16 time active list is empty. Note that only non_strict priority QOSs are loaded to active list 

17 and strict priority QOSs are always serviced from waiting list. To load non priority QOSs 

18 to active list use 

19 if(&qa_empty[7:0] = 1) 

20 qa_empty[0] = priority[0] | (--priorityLO] & qw_empty) 

21 qw_empty[0] = --prioirty[0] | (priority[0] & qw_empty[0]); 
22 

23 Levels of scheduling 

24 Once a port is selected according to the port calendar, a QOS is selected between 

25 all the QOSs that belong to the port. Once a QOS is selected a FlowID is selected 

26 between the FlowIDs that belong to that QOS. See Figure 382. 
27 

28 Weight Quota 

29 The weight of a QOS is calculated based on two values. The first value is the 

30 weight parameter, which is stored per QOS in the QOS descriptor memory and the 

31 second parameter is the weight_quota, which is a programmable vale per all the QOSs. 

32 The weight of a QOS is the multiplication of the weight and the weight_quota. 
33 

34 Input Algorithm 

35 1. Get FlowID and QOS from the shaper/DBS. 

36 2. Read QOS memory: 

37 • F_WP = QoS^mem[F^QOS][F_WP] 

38 • F_EMPTY = QoS_mem[F_QOS][F_EMPTY] 

39 • Q_NUM = QoS_mem[F_QOS][Q_NUM] 

40 • PORTJD = QoS_mem[F_QOS][PORT_ID] 

41 3. Read Port memory: 

42 • QA_EMPTY, QW^EMPTY = port_mem[PortJD][ QA_EMPTY, QW_EMPTY] 

43 • PRIORITY = port_mem[PortJD][PRIORITY] 

44 4. Write to memories: 

45 // Link FlowID to previous one 

46 • Write flow_mem[F_WP][F_NEXT] <= FlowID 

47 // Update tail of the link list of the FlowIDs 

48 • Write QoS_mem[F^QOS][F_EMPTY] <= 0 



129 



AZA-001/2001-P00^^ PATENT 

1 • Write QoS_mem[F__QOS]lF_ WP] FlowlD 

2 • If (F_EMPTY == 1) Write QoS_mem[F_QOS][F^RP] <= FlowlD 

3 // Marfc the QOS not empty in the port if needed 

4 If (QA_EMPTY[Q0S_NUM]==:=1)&(QW_Ef\/IPTY[Q0S_NUf^] == 1) 

5 • Write port^mem[PORT_ID][QW_Et^PTY[QOS^NUM]] <= 0 

6 // Link port to calendar in case of Ingress chip configuration 

7 If ({M0DE^=1) & ( IQA^EMPTY) & ( IQW^EMPTY)) begin 

8 Write port_mem[INGRESS_PTR[15:8]] PORTJD 

9 INGRESS_PTR[15:8] <= PORTJD 

10 lf(INGRESS_PTR[16]==0) INGRESS_PTR[7:0] <= PORTJD 

11 INGRES_PTR[16] = 0 

12 end 
13 

1 4 Output Algorithm 

15 The algorithm describes here doesn't show the conditions for selecting one of the 

16 options in step 9. 

17 ) . Get Port ID from Port Calendar. 

18 2. Read from port memory, 

19 • All parameters 

20 3. If (I (QOS^EMPTY)) 0) terminate output stage 

2 1 Else continue to step 4 

22 4. If active list is empty -> Move waiting list to Active list and clear waiting list. 

23 5. Select QOS according to priority, WRR and previous QOS in middle of packet indication, 

24 6. Read from QOS memory, address is {PortID, QOS}. 

25 • All parameters 

26 7. Send to PFQ the F_RP value (This is the scheduled FlowlD) 
21 8. Read FlowlD memory 

28 • All Parameters 

29 9. Get port status from Status block. 

30 10. Select one of the following options depending on parameters from all memories: 

31 o delink Jlowid 

32 o moveJlowidJo_end_of_active 

33 o dont_moveJlowid_and_update_celLcnt 

34 o do_nothingJoJlowid 

35 o delink_QoS 

36 o move_QoSjo_end_of_active 

37 o don't_move_QoS_and_update_weight_m 

38 o move_QoSjojA/aiting 

39 o do_nothingJo_QoS 

40 o For Ingress chip only: 

41 ■ move_portJo_end_ofJinkJist 

42 ■ delink_port 
43 

44 11. Update internal and external memories according to above decision. 
45 

46 Algorithm to calculate q_weight_m during output phase 

47 ifiqosjempty) 

48 q_weight_m <- 25 'hO 

49 else if(q_weight_m<0 & eop) //move to waiting list 

50 q_weight_m <= q_weight_m - 1 •¥ (q_weight * weight _quota) 

5 1 else if(q_weight_m!=25 'hlOOOOOO) // not the maximum negative value 

52 q_weight_m <= q_weight_m - 1 
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2 Memory links for Input Phase 

3 The input phase starts with a FlowID from the DBS. With this parameter all memories 

4 are accessed in sequence to get all relevant parameters required for the input phase. See 

5 Figure 383. 
6 

7 Memory links for Output Phase 

8 The output phase starts with a port assigned by the port calendar inside the DBS 

9 block. The port parameters are read from the internal memory, and a QOS is selected for 

10 that port based on the none-empty QOSs and the strict priority. The address to the QOS 

1 1 memory is {PortID, QOS_SELECTED}. The read pointer in the QOS parameters is the 

12 FlowID to send to the PFQ. Once reading the parameters of that FlowID it can be sent to 

13 the PFQ together with the EOP indication. See Figures 384 and 385. 
14 

15 CPU Port 

16 CPU port is different then the rest of the 64 ports supported. The CPU port does 

17 not have any QOS and it's rate is fixed to OC3. FlowlDs are linked to the CPU ports 

18 using the external NEXT memory as the next field. FlowIDs are scheduled based on a 

19 simple round robin between them. 
20 

21 Blocks Description 

22 The scheduler is divided to several different functional blocks as follow: 
23 

24 Input Phase Block 

25 The input phase contains a small FIFO to accumulate the link requests from the 

26 shaper. These link requests are served once an input slot is available. During this stage a 

27 FlowID is linked in a link list and if needed a QOS is linked to the waiting lists. Also the 

28 port associated with that FlowID is marked as non-empty. 
29 

30 Output Phase Block 

31 The output phase block receives a port from the port calendar. Based on the status 

32 of that port, a decision is made if to go ahead with scheduling (if the port is not full) or to 

33 dedicate the bandwidth to the software (if the port is full). If scheduling is ok, a QOS is 

34 selected according to a weighted-round-robin algorithm and based on that QOS a FlowID 

35 is selected. This FlowID is then transferred to the PFQ. 

36 If the cell count of a FlowID becomes zero, means a full packet was sent, the 

37 FlowID would be de-linked from the link list if there is no other packets linked to this 

38 FlowID. It is possible that because of that the QOS will be de-linked as well. 
39 

40 Software Interface 

41 Software needs to access the scheduler database for several purposes: 

42 1. Initialization after reset: port calendar and QOS databases as well as internal registers. 

43 2. When setting-up a connection, to write the FlowID parameters, 

44 3. For testing purposes of the internal memories mainly. 

45 All software accesses to the scheduler are done through specific registers during the 

46 dedicated time slots. 
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2 Memory Description 

3 The following is a description of the memories needed to implement the 

4 scheduling algorithm. 
5 

6 External Memory - Next FlowID 

7 Address: Flow ID, Length: IM (20 bits wide) 

8 See Figure 386. 
9 

10 Internal memory - QOS Parameters 

1 1 Address: QOS, Length: IK (17 bits wide) 

12 See Figure 387. 
13 

14 Internal memory - QOS Descriptors 

15 Address: QOS, Length: IK (66 bits wide) 

16 See Figure 388. 
17 

1 8 Internal memory - Port Parameters 

19 Address: Port ID, Length: 64 (21 bits wide) 

20 See Figure 389. 
21 

22 Internal memory - Port descriptor 

23 Address: Port ID, Length: 64 (47 bits) 

24 See Figure 390. 
25 

26 Internal memory - Next Port 

27 Address: Port ID, Length: 64 (8 bits) 

28 See Figure 391. 
29 

30 Software Interface 

3 1 The scheduler can be programmed via the software interface by a CPU. All the 

32 scheduler's control registers and memories can also be written and read for test purposes 

33 through the software interface. The following sections list all the control registers and all 

34 the scheduler software interface commands. 
35 

36 Registers 

37 See Figure 392. 
38 

39 Commands 



40 Read FID NEXT Memory 

41 See Figure 393. 
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Write FID NEXT Memory 


2 


See Figure 394. 


3 


Read QOS Parameters 


4 


See Figure 395. 


5 


Write QOS Parameters 


6 


See Figure 396. 


7 


Read QOS Descriptor 


8 


See Figure 397. 


9 


Write QOS Descriptor 


10 


See Figure 398. 


11 


Read PORT Parameters 


12 


See Figure 399. 


13 


Write PORT Parameters 


14 


See Figure 400. 


15 


Read PORT Descriptor 


16 


See Figure 401. 


17 


PQ[0] = PREV_QOS_VALID 


18 


PQ[3:1] = PREV_QOS[2:0] 


19 


PQ[6:4] = ACTIVE_PTR[2:0] 


20 




21 


Write PORT Descriptor 


22 


5>ee rigure 402. 


23 


Read PORT Next 


24 


See Figure 403. 
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1 Write PORT Next 

2 See Figure 404. 

3 Init QOS 

4 See Figure 405. 

5 This command write the parameters to the QOS Parameter Memory and also 

6 initiates a write to the QOS Descriptor Memory with the following values: 

7 F_EMPTY = 1 

8 Rest of the fields should be reset to zero 

9 Init port 

10 See Figure 406. 

11 This command write the parameters to the Port Parameter Memory and also 

12 initiates a write to the Port Descriptor Memory with the following values: 

13 QW_EMPTY[7:0] and QA_EMPTY[7:0] = 8'bl 1 1 1„1 1 1 1 

14 Rest of the fields are reset to zero. 

15 Reserved Op-codesReserved Op-codesReserved Op-codes 

16 The op-codes 1 1 1 1 are reserved. 
17 

18 Timing Diagrams 

19 

20 Global Timing for Input/Output Stages 

21 The input phase and the output phase should be combined during the 8 cycles 

22 budget per cell. Since we do so, there are data dependencies between input phase and 

23 output phase. Figure 407 shows the memory accesses needed for each memory for every 

24 8 cycles of input/output phase (Figure 408 shows the addresses, all data is 2 cycles later 

25 for the external memory): 
26 

27 Input/Output stage Pipes 

28 Input Stage 

29 The input stage actually starts one cycle before cycle 1 by sampling a valid 

30 signal from the DBS that holds for the entire input stage cycles. If there is no valid signal 

31 asserted, the input pipe is frozen by not changing the first pipe stage (this will also save 

32 power). See Figure 408. 

33 Output Stage 

34 When working as an egress chip, the output stage starts with reading the port 

35 memory parameters using a port ID sent by the port calendar inside the DBS. When in an 
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1 ingress chip mode the location to read from is determined by the INGRESS_PTR, which 

2 is a register controlled by the scheduler. 

3 The output stage diagram doesn't show the data dependencies to the input phase. 

4 Data dependencies will be discussed in the next section. See Figure 409. 
5 

6 Data Dependencies 

7 Data dependencies between the output stage and the input stage 

8 In the scheduler, the data dependencies might happen when the input stage and 

9 the output stage access to the same memory address. If one stage reads a memory 

10 location before the other finishes updating the same memory location, then the read result 

1 1 might be stale. Or if both stages write the same memory address, the later write might 

12 overwrite the previous write result. 

13 There is no data dependency issue for the FlowID memory since the input and 

14 output stages are mutually exclusive on a specific FlowID, i.e, the input and output stages 

15 never work on the same FlowED. If the input stage is linking a FlowBD into the linked list, 

16 the output stage is not aware of the existence of the FlowID yet so there is not output 

17 phase for this FlowID. If the output stage is scheduling a FlowID, the input stage can not 

18 link the same FlowID by the principle of operation. Even if by looking at the FID Next 

19 memory operations, there is no overlapping between the input stage and the output stage 

20 memory accesses. 

2 1 For the QOS memory accesses, the input stage only updates the field of F_WP at 

22 cycle 7. But the output stage may also update the F_WP field at cycle 6 of the next 8 

23 cycle slot. To resolve the data dependency issue, when the QOS address is the same for 

24 the input and the output stages, the input stage updates not only the memory field of 

25 F_WP at cycle 7 but also the register field of F_WP which stores the output stage read 

26 result of cycle3. 

27 By the same token in the port memory case, when the input and the output stages 

28 access to the same port memory address, the input stage updates the memory field of 

29 QW_EMPTY at cycle 7 and at the same time it has to update the same QW_EMPTY 

30 field of a register which stores the output stage read result at cycle 2 to avoid the data 

31 dependency. 

32 The output stage data dependencies between consecutive 8 cycle time slots 

33 The port memory accesses of the output stage take more that 8 cycles, which 

34 implies that the port memory accesses in one time slot overlaps the port memory accesses 

35 in the following time slot. More specifically, before the previous 8 cycle time slot updates 

36 the port memory at cycle 6, the current 8 cycle slot has to read the port memory at cycle 

37 5. 

38 To avoid the data dependencies when the consecutive 8 cycle time slots access to 

39 the same port (there is only one port in this case), the read result has to be taken from a 

40 register which stores the write date of the previous slot memory write instead of the 

4 1 memory content. 
42 
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1 Traffic Shaper/Meter 

2 This block can operate in 2 modes: 

3 • Traffic Shaper mode: in this mode the Shaper shapes specific rates to FlowIDs 

4 according to Single or Dual Leaky Bucket algorithm. The shaper selects a Peak rate 

5 or a Sustained rate per FlowID depending on an accumulated credit. Once a FlowK) 

6 has been shaped, it is marked as such in a common. 

7 • Traffic Meter mode: in this mode the Meter measures the rate of the incoming 

8 FlowK) traffic and mark the FlowID as a violator if the rate is above a specific 

9 threshold which is programmed for a FlowID. The user periodically read the 

10 indications in the FlowID memory. 

11 See Figure 410. 
12 

13 4.1 Features 

14 1. This block operates at 200MHz clock 

15 2. A FlowID can be issued from the Database at max once every 8 cycles of 

16 200MHz clock. 

17 3. A FlowID is driven to the Database at max once every 8 cycles of 200MHz clock. 

18 4. Shapes input traffic per FlowID using Single or Dual -Leaky-Bucket algorithm. 

19 5. Shapes traffic for rates from ~48.8kbps up to 9.920Gbps. 

20 6. Meter traffic from 12.8kbps up to 9.920Gbps in increments of 64kbps. 

21 7. Up to 1024 different programmable rates can be shaped simultaneously. Each 

22 shaped FlowID will point to one of the entries in this table for a specific shaping 

23 rate. 

24 8. Up to 1024 different programmable thresholds can be metered simultaneously. 

25 Each metered FlowID will point to one of the entries in this table for a specific 

26 threshold rate. 

27 9. Each FlowID is programmable to have Dual Leaky Bucket, Single Leaky Bucket 

28 or no shaping at all. 

29 10. Can shape or meter up to IM FlowIDs simultaneously. 

30 11. Supports microprocessor programming of the memories for FID, rates, etc. 

31 12. 28 bits of the external memory in FIDl are used for dual leaky bucket algorithm 

32 only. It is possible not to use these bits if using single leaky bucket only. Note that 

33 for a Meter function the extra memory for the dual-leaky-bucket is not needed. 

34 See Figure 41 1 and 412. Note that the memory bits, which are not used for the single 

35 leaky bucket should be tied to zero. 

36 See Figure 413. This block doesn't need all the memories for every single mode of 

37 operation. Refer to the memory description chapter to see the usage of the memories in 

38 each mode. 
39 

40 Functional Description 

41 The traffic shaper receives an input phase from the Database block along with the 

42 RatelD and then starts shaping the FlowED on a per cell basis. Each time a cell is shaped 

43 the Shaper generates a message to the Database. The Database then responds with an 

44 empty indication. All packets link list and output to the PFQ are done in the Database 

45 block. 
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1 

2 Leaky-Bucket Algorithm 

3 The basic leaky-bucket algorithm is an accumulation of credits for the given 

4 FlowID, once the amount of credit is enough for a cell, this cell is considered shaped and 

5 is transferred. The equation to calculate is: 

6 Rout = Rin*M/N 

7 When Rm is the input rate (based on the selected class), M input credit, N output 

8 credit. The following Figure shows an example of M/N = 2/5: 

9 See Figure 414 
10 

1 1 Dual-Leaky-Bucket Algorithm 

12 A dual-leaky-bucket contains 2 virtual buckets per FlowID (virtual because credit 



13 is accumulated only in one bucket). One bucket shapes the traffic for a sustained data rate 

14 and the other shapes the traffic for a peak data rate. 

15 The shaped traffic starts by using the Ms/Ns values of the sustained data rate. If for some 

16 reason the accumulated credits are above a specific threshold amount, then the shaped 

17 traffic uses the Mp/Np values of the peak data rate until the amount of credits returns to 

18 zero. 



19 The credit is calculated using the current time measurement. When shaping a cell, 

20 the difference between the last shaping time and the current shaping is time calculated to 

21 create AT this is the elapsed time from the last shaped cell. ACredit is calculated as 

22 follow: 

23 ACredit = AT * Ms- Ns 

24 New.Credit = 01d_Credit + ACredit 

25 If AT is long enough then a positive credit is accumulated. If AT is short then a 

26 negative credit is accumulated. Once New_Credit is above the threshold then the shaping 

27 is done with the peak rate while the credit is always calculated with the sustain data rate, 
28 

29 Implementing Timing Wheel for leaky-bucket algorithm 

30 The timing wheel contains 256K time slots. The wheel is rotating in a rate of one 



31 time slot every cell cycle. Each time slot can have few FlowIDs link to it. There is one 

32 exit point in the timing wheel, which is time zero. A FlowID is linked to a future time slot 

33 based on its shaping parameters. Once that time slot reached time zero, all the FlowIDs 

34 that are linked to that specific time slot are dropped to an output FIFO. When a FlowID 

35 enters the output FIFO it means that one cell for that specific FlowID is shaped. 

36 Once every cell cycle one FlowID is taken from the output FIFO and is being serviced, 

37 means, a new time slot is calculated for the FlowID based on the ratelD and some other 

38 parameters. The FlowID then is linked to that future time slot. In parallel the shaper sends 

39 this FlowID to the Database block, in this block a cell count and packets link list are 

40 maintained per FlowID. 

41 It is possible that several FlowIDs are linked to the same time slot. When this time 

42 slot arrives to time zero, several FlowIDs are scheduled to the output FIFO at the same 

43 cell cycle. In this case the wheel services only one FlowID when the rest are waiting in an 

44 output FIFO to be serviced in the next cell cycle. See Figure 415. 
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1 The timing wheel is actually fixed and only the zero reference point is moving. 

2 That means the wheel can be implemented as a memory and the zero point is a moving 

3 address, which is incremented once every cell cycle. 

4 A single leaky bucket algorithm is implemented using the timing wheel. As described 

5 above, the Equation to implement the Leaky Bucket is: 

6 Rout = Rin*M/N 

7 When assigning K to N/M, we will get 

8 Rout = Rin / K 

9 The K parameter is the time in the future to place the FlowE). For example if K=3 

10 means the Rout is 1/3 of Ru, then the FlowE) is linked to the time slot which it's distance 

1 1 from the current time is 3. Since the timing wheel is rotating once every cell cycle (once 

12 every 8 cycles) the next time the FlowID will be seen is after 3 cell cycles when the time 

13 slot will reach zero: 

14 See Figure 416. 
15 

16 Fractional value of K 

17 It is possible that K will have a fractional value and an integer value for example 

18 to get 40% of the bandwidth K should have the value of 2.5. It is impossible to have a 

19 fractional time slot, therefore, to place a FlowID in a future time slot only the integer part 

20 of K is used. The fractional part of K is getting accumulated and is added to the integer 

21 part only if it is greater or equal to 1 . For example, if K=l .28 (78. 125% of the line rate) 

22 the shaping will appear as in Figure 417. 
23 

24 Output FIFO in the timing wheel 

25 The output FIFO is a link list of the FlowDDs that are shaped for one cell and are 

26 waiting to be re-linked to the wheel to some future timing slot. The output FIFO is used 

27 because it is possible that more than one FlowID will be linked to a specific slot. In that 

28 case, when the slot arrives to time 0, several FlowIDs are shaped at the same time. Since 

29 the Shaper can handle only one FlowID every cell cycle, the rest of the FlowIDs have to 

30 wait. 

31 Each time a new list of FlowIDs are linked to the output FIFO, they are marked 

32 with the time arrival to the output FIFO. When calculating the future time, the result is 

33 decreased by the amount of time the FlowID was in the output FIFO. For example, 2 

34 FlowIDs are linked in the output FIFO, Kfidi=3 and Kfid2=5. The scheduled time slots 

35 will appear as in Figure 418. 
36 

37 Dual-Leaky-Bucket using timing wheel 

38 To create a dual-leaky-bucket in timing wheel there is a need for the following 

39 parameters: 

40 • Ks for sustain data rate. 

41 • Kp for the peak data rate. 

42 • Ms, Mp Sustain and Peak input rates. 

43 • Threshold the threshold value of the accumulated credits to switch to peak rate. 

44 • Credit the total amount of positive credits (always a positive number). 

45 • Residue the accumulation of the fractional part of Ks or Kp. 

46 • TimeA the time of arrival to the output FIFO. 
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1 • TimcL time of last scheduled cell to a time slot. 

2 • Back Ground Process is used to mark timer wrapping around since measuring AT. 

3 There are 3 processes for a FlowID: input phase, output phase and a background process: 

4 • Input phase is when the FlowID is first linked to the timing wheel, same calculation 

5 as the output phase is applied except the residue in this case is 0. 

6 • Output phase is when the FlowID is services in the output FIFO. 

7 • Background process is a round robin service between all FlowEDs (IM of them) to 

8 mark the wrapping around of the current time. This is done for a FlowID that 

9 becomes empty and after a while becomes non empty, the credits should be calculated 

10 properly for the FlowID. 

11 When changing the sustain rate to a peak rate then the peak rate is used until the 

12 amount of credits is down to zero. The calculation of ACredit is being done using the 

13 sustain data rate always. Figure 419 describes the use of sustain and peak rates. 

14 

1 5 Calculation of NCredit 

16 The calculation of the ACredit depends of the rates and the time elapsed from the 

17 last output phase. There is a small difference between the calculations in the output or the 

18 input phases. 

19 K is a pure number. K multiple by one time tick gives the amount of time ticks 

20 needed to shape one cell. The equation is as follow: 

2 1 ACredit = (AT - 1 TimeTick * K) / K = (AT - K) / K 

22 AT is the time elapsed from the last output phase. K is decremented from AT since 

23 during that time a cell was shaped. The division by K allows measuring the credits in 

24 terms of cells. Note that the K in the division is the pure number (which is N/M) when 

25 the K inside the parenthesis has units of time. 

26 • When the output phase uses the sustain or peak rates the equation to be used is: 

27 ACredit = (AT - Ks) / = AT/Ks - 1 

28 • When the input phase calculates the change in the credit there is no cell shaped 

29 therefore: 

30 ACredit = AT/Ks 

3 1 Equations (Algorithm) for Input phase 

32 7. NewCredit^O 

33 2. TimeSlot = INT(KJ // integer part ofK^ 

34 3. NewResidue = FRAQKs ) //fractional part ofKs 

35 Equations (Algorithm) for Output phase 

36 7. Ar = CurrentTime-Timei 

37 2. Calculate ACredit 

38 5. TempCredit = Credit + ACredit 

39 4. If[TempCredit>Threshold) 
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1 NewCredit = Threshold 

2 Select K = Kp 

3 Else 

4 NewCredit = TempCredit 

5 Select K^Ks 

6 5. NewTime^(INT(K-^ Residue) 'TimeA 

7 6. TimeSlot = if[NewTime<0) ? 0 else NewTime 

8 7. NewResidue = FRAQK-^-Residue) 

9 8. NewMarkBits = 11 

10 Equations (Algorithm) for Background phase 

11 L if[MarkBits==ll) NewMarkBits = 10 

12 2. if(MarkBits==10) NewMarkBits = 0 
13 

14 Rin and K factor 

15 During input phase or output phase a future time slot is calculated according to K 

16 and time variables. A slow rate, means a high K value, will cause a linking of the FlowID 

17 to a far timing slot. Whereas a high rate, means a low K value, will cause a linking of a 

18 FlowID to a closer timing slot. This chapter is to describe the exact Rin for the calculation 

19 of: 

20 Rout = Rin / K 

21 The chip runs at 200MHz and the data path is 64 bits, therefore, 

22 Rn = 200 MHz * 64bits = 12.8 Gbps 

23 There are 256K timing slots in the timing wheel, therefore, the slowest rate that it 

24 can process is: 

25 Rout(inin) = 12.8G/256K = 48.828125 kbps 

26 In order to have a shaped rate of 64kbps, K should be 200,000: 

27 64kbps = 12.8G/200000 

28 That means that if a FlowID is placed in the 200000^*" time slot in the future each 

29 time, its output rate is exactly 64kbps. Figure 420 shows some of the standard rates with 

30 their corresponding K. Note that the ratio between OC192 (9.92G) to 12.8G is 1.290 
31 

32 Violator indication 

33 When using a dual-leaky-bucket algorithm the violator indication should be set 

34 when using the peak rate. When using a single-leaky-bucket algorithm, the violator 

35 indication should not be set. 
36 

37 Metering 

38 Metering is the other mode of operation of this block. The user can assign one of 

39 IK different rates pre defined by the user to a specific FlowID and the block will monitor 

40 the traffic to that HowID. If the actual traffic rate of the FlowID, the measured rate, is 

41 greater then the assigned rate then the FlowID is marked by the hardware. 

42 The user should periodically read the marking bit per active FlowID. Once the user reads 

43 the FlowID information, the bit is cleared and remains clear until the input traffic is 

44 violating again. 
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2 External memory for Metering mode 

3 The meter block used the same memory bits of the FlowID but for different 

4 meaning. Each FlowED needs to have a counter to count number of incoming cells. Also 

5 each FlowDD needs to have a bit to mark the FlowID as a violator of the assigned rate or 

6 not. 
7 

8 Background process of Metering 

9 The background process of the metering is used to check for the accumulated 

10 amount of cells per FlowID every specific amount of time. Once every 8 cycles the 

1 1 background process services one FlowID. That means, a FlowID is being checked by the 

12 background process once every 40mSec (at 200MHz clock). 

13 To provide more accuracy to lower rates, a counter is added to each FlowID to 



14 control the amount of times the FlowID is accessed by the background process before the 

15 actual measurement and marking is done. When the background process accesses a 

16 FlowID and this counter is greater then zero, then the background process only decrement 

17 that counter, when the counter is zero the background process compares the accumulated 

18 amount of cells to the threshold, mark the FlowID is the threshold is smaller then the 

19 accumulated amount and then the cell counter is reset, the background counter is set to 

20 the value assign by the user. 

21 Once the metering algorithm marked a FlowID as a violator, this FlowID is to 

22 remain marked until the user reads the FlowID memory. When the user reads the FlowK) 

23 memory, the mark bit is reset by the hardware. 
24 

25 Output FIFO for Metering 

26 During the input phase of a FlowID in the metering mode, the FlowID can be 

27 marked as a violator. If it is to be marked as a violator, then the FlowID is linked to a link 

28 list of violators. The software accesses this list when it wants to get the list of the 

29 violators, this way the software doesn't have to read all FlowIDs to detect the violators. 

30 Once the FlowE) is linked to that list, only when the software reads it, it will be removed 

31 from the list. The software first read the value of the MARK_RP register, which is the 

32 pointer to the head of the violators link list and then read the FlowID memory. The Meter 

33 generates a write to the FlowID memory to clear the mark bit, 
34 

35 Block Description 

36 Shaper's Input Phase 

37 The Shaper input phase receives a ratelD, FlowID and valid signal from the 

38 DataBase block. This is a request from the database block to shape the FlowID. Once the 

39 FlowID is inside the shaper, no more input phase is initiated by the DBS block. 

40 If the specific time slot was empty then it is marked as non-empty, if the specific time 

41 slot was not empty then the FlowID is linked to the last FlowID in the link list. 
42 

43 Shaper's Slot Background process 

44 The Traffic Shaper Wheel is constantly rotating one time slot every 8 clocks of 

45 the 200MHz clock. As each time slot becomes current, the background process is 

46 activated to get al the FlowIDs from the current time slot and link them to the output 
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1 FIFO link list. While linking the FlowIDs from the time slot to the output FIFO, the time 

2 slot is marked as empty and become the last time position in the wheel. 
3 

4 Shaper's FlowID Background process 

5 This background process is to mark the time in each FlowID such that in the next 

6 output phase a AT is calculated correctly even if the free running time counter was 

7 wrapped around. The process resets the bits while the output phase sets the bits. Each 

8 FlowID is serviced once every IM cell cycles (8M clock cycles). In order to mark a 

9 FlowID to use only the threshold value as a credit without using AT, this process has to 

10 be repeated 2 times. Therefore, a FlowID is marked as "too old to calculate credit" at 

1 1 least after lM+1 cell cycles and no more then 2M cell cycles (16M clock cycles). In this 

12 case the credit value becomes the value of the threshold. 
13 

14 Shaper's Output Phase 

15 The output phase operates once every cell cycle on the FlowBD, which is the head 

16 of the output FIFO's link list. Operating on a specific FlowID means shaping one cell of 

17 the FlowID. The shaper drives the FlowID, which is the head of the output FIFO's link 

18 list to the database block. The database block manages the link list of the packets pr 

19 FlowID. The act of driving a FlowID to the database block means that one cell is shaped 

20 for that FlowK). 

21 The database can return a message to the shaper that the current FlowID is empty, 

22 in this case the shaper will not link the FlowID to a future time slot. Note that the Shaper 

23 always shapes a cell at a time for packets traffic and for cells traffic. It is the Database 

24 block that keeps track of shaped packets/cells and generate de-queue commands to the 

25 PFQ accordingly. During metering mode, the output phase is disabled. 
26 

27 Shaper's Time measurement 

28 For the dual-leaky-bucket algorithm there is a need to measure time for 

29 calculating the change in the credit. A 21 bits register is used to count the time, this 

30 counter is incremented once every 8 cycles. For all calculations the logic used only 18 

3 1 bits of the time counter except for the FlowK) background process. 

32 Since there are IM FlowIDs supported there is a need to measure time up to IM 

33 units and more, that's is why the time counter has 21 bits. 
34 

35 Traffic Shaper Algorithms 

36 The below algorithms are implementation specific and not only the mathematical 

37 equations and conditions. To understand the algorithms, one should look at the memory 

38 description first. Memory description. 
39 

40 Shaper Input Phase Algorithm (for single leaky bucket) 

41 Receive from database block: VALID, FID, RATEID 

42 2. RD shp_rateJd_mem[RATEID]{Ks, Kp, Threshold, Mj 

43 J. RD FID2_mem[FID]f start, bg_stt, timejnsb} 

44 RD FIDI_mem[FID]{lastJime, residue, credit, p_s} 

45 4. // Calculate future time slot and residue based on previous values 

46 z/r (p_s==l ) & bg_stt[I:0]!=2 WO ) 
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1 a) slot_id[17:0] <= cur _slot_addr[ 17:0] + INT(Kp) 

2 b) n_residue[17:0J <= FRAQKp) 

3 c) ifiVALlD & INT(Kp)==0) link_to_outJifo <= 1 else link_to_outJifo <= 

4 0 

5 else 

6 a) slotjd[17:0] <= cur_slot_addr[17:0] + /iVrrA:5) 

7 Z?) n_residue[17:0] <=FRAC(Ks) 

8 c) iflVALID & INT(Ks)==0) link_to_out_fifo <= i e/se linkjojoutjifo <= 

9 0 

10 5. RDslot_mem[slot_id]{rp_slot,wp_slot,e_slot} 

11 6. // Calculate new credit, see block diagram below 

12 n_credit = f(current_time, lastjime, Ks, M, credit, threshold, bg_stt) 

13 7. // Calculate rate selector for next output phase 

14 if(n_credit>=threshold) n_p_s <= i 

15 else if(n_credit==0) n_p_s <= 0 

16 n _p_s <= 

17 5. // Creare /mit FID to future time slot 

18 9. if(e_slot==l) 

19 aj (n_rp_slot, n_wp_slot, n_e_slot) <- FID, FID, 0 

20 b) n_start<=l 

21 10. else 

22 a) {n_rp_slot, n_wp_slot, n_e_slot} <= rp_slot, FID, 0 

23 b) n_start <= 0 

24 11. ifilink_to_out Jifo) 

25 Link FID to output FIFO 

26 12. else if(VALID==I) 

27 WR slot_mem[slotjd]{rp_slot, wp_slot, e_slot}<= n_rp_slot, n_wp_slot, 

28 n_e_slot 

29 i 3. // Write back for FID fields 

30 if{VALID==l) 

31 FIDl_word_en[0],Wr FIDl_mem[FID]{residue} <= n_residue 

32 FIDI_word_en[3], Wr FIDl_mem[FID]{ credit, p_sj <= njcredit, n_p_s 

33 FID2_word_en[0J,WrFID2_mem[FID]{start} <=n_start 

34 if(VALID==l & e_slot==0) //to avoid a write to an unknown FID's next field 

35 FID2_word_en[I], WR FID2_mem[wp_slot]{next_FID, bg_stt, 

36 time_msb} <- FID, 11, cur_time_reg[20] 

37 Slot background Phase Algorithm 

38 1. current _slot <= current jslot + 1; //wraps around 256k 

39 2. RD shp_slot_mem[current_slot]{rp_slot, wp_slot, ejslot} 

40 i. WRshp_slot_mem[current_slot]{e_slot} <= I'bl 

41 If(e_slot==l) 

42 a. c?o nothing //nothing to add to the output registers 

43 5. e/xe 

44 a. wp_out_reg <= wpjslot 

45 Z?. wr_en_shp_FID_next_mem <= (e_out_reg==0) 

46 c. if(e_out_reg==0) //output fifo is not empty 
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1 rpjoutjreg <= rpjout_reg 

2 //. e_outjreg <= e_out_reg 

3 d else 

4 rpj}utjreg <= rp_slot 

5 n. ejoutjreg <= 0 

6 6. //■ (wr_en_shp_FID_next_mem ==1) 

7 FID2_word_en[0l WR shp_FID_next_mem[wp_out_reg]{next_FID} <= f77_5/c?r 

8 7. If[e^slot==0) 

9 FIDl_word__en[ll WR FIDl_mem[rp_slot]{ arrival Jime} <= current Jime 
10 

1 1 FID Background Phase Algorithm (for dual-leaky-bucket only) 

12 7. current J'lD <= currentJFID + 7; 

13 2. /2D FID2jnem[current_FID]f empty, bg_stat, time_msb, start} 

14 5. case (bg_stat) //synopsys full case parallel case 

15 a. 2'bll:n_bg_stat<=2'bl0 

16 ft. 2'blO:n_bg_stat<=2WO 

17 c. default: rbOO 

18 endcase 

19 4. FlD2_wordjen[l], WR FID2_mem[current_FID]{empty, bg_stat, time_msb, start j< 

20 n_empty, nj)g_stat, time^msb, start 

2 1 Shaper Output Phase Algorithm 

22 i. /fD FIDl _mem[rp_put_reg]{ residue y arrivaljime, last Jime, credit, p_sj 

23 2. /?D FID2_mem[rp_out_reg]fnext_FID, bg_stt, time_msb, start} 

24 5. 5end/ to Database block: rp_out (as 0_FID), --ejoutjreg (as 0_VALID) 

25 4. Receive RatelD from Database 

26 5. /fD rateid_mem[RateID]{Ks, Kp, threshold} 

21 6. // Calculate future time slot and residue based on previous values 

28 time_spent_inJifo[ 1 7:0 ] <- current Jime f 1 7:0] - arrival Jime[ 1 7:0] 

29 if[pj==l) 

30 a) slotjd[18:0] <= cur_slot_addr[17:0] + INT(Kp) - 

31 time_spentjn Jifo[17:0] 

32 njesidue[17:0] <=FRAC(Kp) 

33 ifiVALID & INT(Kp)==0) linkjo_out Jifo <= 1 else linkJo_outJifo < = 

34 0 

35 else 

36 slotjd[18:0] <= cur_slot_addr[17:0] + WrfAT^j - 

37 time_spentjn Jifo[17:0] 

38 ^) njesidue[17:0] <=FRAC(Ks) 

39 a. if(VALID & INT(Ks)==0) linkJoj)ut Jifo <= 7 else linkjo_outSifo <= 

40 0 

41 7, // Calculate new credit 

42 a. njredit =f( current Jime, last Jime, residue, Ks, Kp, M, credit, threshold) 

43 8. // Calculate rate selector for next output phase 

44 if(n_credit>=threshold) n j)_s <- 1 

45 else if(njredit==0) n j?_s <=0 

46 n J? J <- p_s 
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1 9. // prepare values for FID link to a slot 

2 RD slot_mem[slotjd]{rp_slot, wp_slot, e_slotj 

3 if(e__slot==l) 

4 a. {njrp_slot, n_wpjslot, n_e_slot} <= FID, FID, 0 

5 b, n_start <=1 

6 c. next_wr_en <= 0 
1 else 

8 a. {n_rp_slot, n_wp_slot, n_e_slot} <- rp^slot, FID, 0 

9 b, n_start <=0 

10 c. next_wr_en <= 1 

1 1 10, Receive EMPTY indication from Database 

12 IL// Link FID to future time slot 

13 if(0_VALID==l & -EMPTY & linkjojoutputjifo) 

14 link FID to output FIFO 

15 else if(0_VALID==l & -EMPTY ) 

16 WR slot_mem[slotJd]frp_slot, wp_slot, e_slot}<- n_rp_slot, n_wp_slot, 

17 n_e_slot 

18 12,// Write back all FID variables 

19 if(0_V/iLID & -EMPTY) 

20 WR FlDljnem[rpj)ut_reg]{ residue} < = njresidue 

21 WR FlDl_mem[rp_out_reg](last_time} <= currentjimejreg 

22 WR FIDljnem[rp_out_reg]{credit, p_s} <= n_credit, n_p_s 

23 WR FID2_mem[rp_out_reg]{bg_stt, timejnsb, start} <= 11, cur_time_reg[20], 

24 n_start 

25 if(next_wr__en"l ) WR FID2_mem[wp_slot]{FID_next} <= rp_out_reg 

26 13. if(rp_out_reg~wp_out_reg & don 7 place FID in output fifo again) e_out_reg 

27 <=i 

28 else e_out_reg <= 0 

29 14. rpjjutjreg <~ next_FID 
30 

3 1 Translate {Exp, Mantissa} presentation of K to Integer and Fraction 

32 The K factors are represented in the database with mantissa and exponent. 

33 • For Ks and Kp the presentation is { 1. mantissa} x 2^""^. exp means the number of 

34 places to shift the decimal point to the right. 

35 • For inverse of Ks the presentation is {O.mantissa} x 2'^''^. exp means the number of 

36 places to shift the decimal point to the left. 

37 • Exp can be common for the Ks value and the InvKS value in the internal memory. 

38 The following algorithm is to translate Ks or Kp value to integer and fraction values from 

39 a presentation of {exp, mantissa}. This code is used in the input phase and in the output 

40 phase, ratejnt = INT(K) and rate_frac = FRAC(K): 

41 7. case(exp) // synopsys full case parallel case 

42 0: ratejnt<={17'b0,l} rate Jrac<=man( 17:0) 

43 1: rateJnt<=fl6W,l,man(17)} ratejrac<=fman(16:0),l 'bO} 

44 2: rateJnt<={15W,l,man(17:16)}rateJrac<={man(15:0),2W} 

45 3:rateJnt<=firbO,l,man(17:15)}rateJrac<={man(14:0),3W} 

46 4: rateJnt<^{13W, l,man(l 7:14)}ratejrac< =fman(13:0),4 W} 
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1 5: rate_int<={12'b0,l,man{l 7:13)} ratejrac< =fman(12:0).5'b0} 

2 6: rate_int<={irb0,l.man(17:12)}ratejrac<={man(ll:0),6'b0} 

3 7: ratejnt<={10'b0,l.man(17:ll)}ratejrac<={man(10:0),7'b0j 

4 8: ratejnt<=f9'b0,l,man(17:10)j rate Jrac<={man(9:0).8'b0} 

5 9: ratejnt<={8'b0.1,man(17:9)} ratejrac<={man(8:0),9'b0} 

6 10: rate_int<={7'b0,l.man(17:8)} ratejrac<={man(7:0),10'b0} 

7 11: ratejnt<={6'b0,l,man(17:7)} ratejrac<={man(6:0),ll'b0} 

8 12: rate_int<={5'b0,l,man(17:6)} ratejrac<={man(5:0),12'b0} 

9 13: rate_int<={4'b0,l,man(17:5)} ratejrac<={man(4:0),13'b0} 

10 14: ratejnt<={3'b0.1,man{17:4)} ratejrac<={man(3:0).14'b0} 

11 75; rate_int<={2'b0,l,man(17:3)} rateJrac<={man(2:0).15W} 

12 16: ratejnt<=frb0,l.man(17:2)j ratejrac<={man(l:0).16'b0} 

13 default: rate_int<={l,man(17:l)l ratejrac<={man(0),17'b0j 

14 // make case 1 7 and above default to keep, for exp>=17 use 17 only 

15 endcase 

16 2. INT(K)<=rateJnt-l FRAC(K) = rateJrac 

17 Calculation of K based on desired Rout 

1 8 Since the shaper performs the following equation: 

19 Rout = Rin/K 

20 K value can easily be calculated as: 

21 K = Riii/Rout 

22 For rates units of [bit/Sec] K is calculated liice this: 

23 K=12.8*10'/Ro„t 

24 Credit calculation diagram 

25 For input phase the credit calculation depends on the following parameters: 



26 njoredit =f(current_time, lastjime, Ks, M, credit, threshold). 

27 For output phase the credit calculation depends on the following parameters: 

28 njcredit = f(current_time, lastjime, residue, Ks, Kp. M, credit, threshold). 

29 Both calculations are almost the same, the implementation should be the same for both 

30 and only some inputs may change if using it for input phase or output phase. 

3 1 Output phase calculation is shown in Figure 42 1 . 
32 

33 Meter Algorithms 

34 

35 Input Algorithm 



36 1. Receive FID, RATEID, VALID, NCELL from database block 

37 2. RDrateid_mem[RATEID] {threshold} 

38 3. RD FIDl_mem[FID] {cell_cnt, mark, bg_cnt, bg_cnt_value} 

39 4. n_cell_cnt_tmp[27:0] = cell_cnt[26:0] + NCELL[15:0] 

40 5. if(n_cell_cnt_tmp[27]==l) n_cell_cnt <= 27'h7ffffff 

41 else n_cell_cnt <= n_cell_cnt_tmp[26:0] 

42 6. n_mark <= mark \ (n_cell_cntjmp[27:0] >{1 'bO, threshold[26:0] }) 

43 7. n_bg_cnt <= bg_cnt 

44 8. if(mark=0 & n_mark==l ) // this FID just violated the metering now - link it to 

45 viol_reg 

46 if(MARK_EMFTY==l) 
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1 {n_MARK_RP, n_MARK_WP. n_MARK_EMPTY} <= FID. FID. 0 

2 wr_next_FID <=0 

3 else 

4 (n_MARK_RP. n_MARK_WP. n_MARK_EMPTY} <= MARK_RP. FID. 0 

5 wr_next_FID <=I 

6 else // either the FID did not violate metering or it is already marked 

7 {n_MARK_RP. n_MARK_WP, n_MARK_EMPTY} <= MARKjRP. 

8 MARK_WP, MARK_EMPTY 

9 wr_next_FID <= 0 

10 9. if[VALID==l) 

1 1 WR FIDl_m€m[FID] <= {n_cell_cnt, njnark. n_bg_cnt. bg_cnt_value} 

12 10. if(VALID==I & wr_next_FID==l) 

13 WR FID2_mem[MARK_WP]{next_FID} <= FID 
14 

15 Background Algorithm 

16 1. Read FID_mem[ FID] {cell_cnt, mark, bg_cnt,bg_cnt_value} 

17 2. if[mark==l ) //freeze since the FID is marked 

18 a. n_bg_cnt <- bgjcnt 

19 b. n_cell_cnt <= cell_cnt 

20 3. else if[bg_cnt==0) // start a new counting cycle 

21 a. n_bg_cnt <- bg_cnt_value 

22 b. n_cell_cnt <= 0 

23 4. else 

24 a. n_bg_cnt <= bgjcnt - 1 

25 b. n_cell_cnt <= cell_cnt 

26 5. Write FID_mem[FIDJ <= {cell_cnt. mark, n_bg_cnt. bg_cnt_value} 
27 

28 Software read process 

29 1. RD FIDl_mem[MARK_RP] {cell_cnt. mark. bg_cnt, bg_cnt_valuej 

30 2. RD FID2_mem[MARK_RP] {next_FID} 

31 3. n_cell_cnt = 0 

32 4. n_mark = 0 

33 5. n_bg_cnt = bg_cnt_value 

34 (5. if(MARK_EMPTY==0) // some marked FIDs are avail 

35 if(MARK_RP= =MARK_WP)// last FID of marked FID 's link list 

36 n_MARK_RP < = MARK_RP 

37 n_MARK_EMPTY <= i 

38 e/se //more than one marked FID's avail 

39 n_MARK_RP <= next_FID 

40 n_MARK_EMPTY <= 0 

41 e/5e //no marked FID's avail 

42 n_MARK_RP <= MARK_RP 

43 n_MARK_EMPTY <= 1 

44 7. ifiVAUD==l & MARK_EMPTY==0) 

45 W/? FIDl_mem[MARK_RF] <= fn_cell_cnt. njnark. njbgjcnt, 

46 bg_cnt_value} 



147 



AZA-OOl/2001-POO 




PATENT 



1 

2 Calculation of threshold for Meter mode 

3 The meter block can measure the number of cells for different intervals of times 

4 depending n the value of the BG_CNT[3:0] as shown in Figure 422, 

5 For each measured output rate there is a maximum amount of cells that can be 

6 accumulated in 40mSec amount of time (threshold for 40mSec). The equation is: 

7 Threshold [#Cells/40mSec] ^ Rout[bits/Sec]/(25*512) = Rout/12800 

8 For example, a rate of 64,000bps means a threshold of 5 cells/40mSec. Since this number 

9 is too small, it doesn't enable much of granularity in the measurement, an increase on 

10 BG_CNT can help since the measurement will be over longer period of time. If BG__CNT 

11 is 15, then the threshold should increase to 5*16 = 80 cells. 

12 The user has the tradeoff between the time to measure and the granularity of the 

13 measurement. 
14 

15 Memory description 

16 Figure 423 shows what memories and bits are needed per mode of operation. 
17 

1 8 Rate ID/Threshold Internal Memory 

19 Shaper Rate ID for Shaper mode 

20 Length: 1024, Address: 10 bit rateK) from database block, #bits: 86 

21 See Figure 424. 

22 Meter Threshold for Meter mode 

23 Length: 1024, Address: 10 bit ThrsID from database block, #bits: 27 

24 See Figure 425. 
25 

26 Shaper Slot Memory 

27 Length: 256k, Address: 18 bit slot location, #bits: 41 

28 See Figure 426. 
29 

30 Shaper/Meter FlowID Memory 

3 1 The FID memory is divided to 2 memory blocks with different address to each. 

32 Word write-enable signals are assigned to these memories for the shaper mode. 

33 Meter algorithm uses 36 bits in FIDl memory and no bits in FID2 memory. 

34 Shaper algorithm uses 65 bits in FIDl memory and 25 bits in FID2 memory. 

35 When using single-leaky-bucket only, the unused memory bits should be tied down to 

36 zero. 



37 

38 FID MEMl for Shaper 

39 Length: IM, Address: 20 bit FlowID, #bits 72 
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1 See Figure 427. (If more bits are needed because of future needs, the credit field 

2 can be reduced) 

3 

4 FID MEMl for Meter 

5 Length: IM, Address: 20 bit FlowID, #bits: 36 

6 See Figure 428. There is no need to word write-enable signals in case of Meter 

7 mode. 
8 

9 FID MEM2 for Shaper 

10 Length: IM, Address: 20 bit FlowID, #bits: 24 

11 See Figure 429. 

12 

13 FID MEM2 for Meter 

14 Length: IM, Address: 20 bit HowID, #bits: 20 

15 See Figure 430. 

16 CPU Interface 
17 

18 Registers Description 

19 See Figure 431. 
20 

2 1 CPU Commands for Shaper mode 

22 

23 Read FIDl memory 

24 See Figure 432. 

25 P - peak_sustain 

26 Registers R2, R3 are used in case of dual-leaky-bucket only. 
27 

28 Write FIDl memory when using shaper 

29 See Figure 433. 

30 P - peak_sustain 

31 Registers R2, R3 are used in case of dual-leaky-bucket only. 
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1 

2 Read FID2 Memory 

3 See Figure 434. 

4 SBT[0] - start (bit 20 in the register) 

5 SBT[2:1] - bg_stt[l:0] (bits [22:21] in the register) 

6 SBT[3] - time_msb (bit [23] in the register) 

7 

8 Write FID2 Memory 



9 See Figure 435. 

10 SBT[0] - start (bit 20 in the register) 

11 SBT[2:1] - bg_stt[l:0] (bits [22:21] in the register) 

12 SBT[3] - time_nisb (bit [23] in the register) 

13 

14 Read SHP_SLOT memory 

15 See Figure 436. 

16 E - empty indication (bit [20] in the register) 

17 

18 Write SHP_SLOT memory 

19 See Figure 437. 

20 E - empty indication (bit [20] in the register) 

21 

22 Read RATEJD memory 

23 See Figure 438. 

24 Ks[17:0] - Ks Mantissa, Ks[22: 18] - Ks Exponent (same for Kp) 
25 

26 Write RATEJD memory 

27 See Figure 439. 

28 Ks[17:0] - Ks Mantissa, Ks[22: 18] - Ks Exponent (same for Kp) 
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1 

2 Setup connection command 

3 See Figure 440. This command resets all fields in FIDl and FID2 memories. 

4 Except the back_count field in FID2 memory, which is driven from the command itself. 

5 Also bg_stt[ 1 :0] and empty bits should be set to 1 during this command. 
6 

7 CPU Commands for Meter mode 

8 

9 Read FIDl memory 

10 See Figure 441. 

11 M - Mark bit (bit [27] in the register) 

12 Cur[3:0] - bg_cnt 

13 Cnt[3:0] - bg_cnt_value 

14 

15 Write FIDl memory 



16 See Figure 442. 

17 

1 8 Read FID2 Memory 

19 See Figure 443. 
20 

2 1 Write FID2 Memory 

22 See Figure 444. 
23 

24 Read Threshold (internal) Memory 

25 See Figure 445. 
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1 

2 Write Threshold (internal) Memory 



3 See Figure 446. 

4 

5 ReadModWrite Meter 

6 See Figure 447. 

7 M - Mark bit (bit [27] in the register) 

8 Cur[3:0] - bg_cnt 

9 Cnt[3:0] - bg_cnt_value 

10 This command triggers a write back to the same location clearing all values to zero 

1 1 (except the bg_cnt_value). That's is why this command has a different opcode then a 

12 simple read. 

13 

14 Setup connection command 

15 See Figure 448. 

16 This command writes the following data to FIDl memory: 

17 celLcnt <= 0, mark <= 0, bg_cnt <= cnt[3:0], bg_cnt_value <= cnt[3:0] 

18 

19 Reserved Op-codes 

20 The op-codes 1000, 1001, 1010, 1011, 1100, 1101, 1110, 1111 are reserved. 
21 

22 Timing Diagrams for Shaper 

23 The Shaper is able to process one input phase, one background phase and one 

24 output phase every 8 clock cycle slots. 
25 

26 Timing Diagrams for Meter 

27 The Meter needs to perform one input phase, one background process and one 

28 CPU RdAVR accesses during 8 cycles budget. Figure 449 is a timing diagram that shows 

29 the accesses to the memories during these 8 cycles. 

30 The CPU RdAVr operation and the Meter software phase cannot exist together at 

3 1 the same 8 cycles. If a ReadModWrite command is issued then no other CPU interface is 

32 possible during these 8 cycles. 

33 For regular ReadAVrite from the internal/External memories, all accesses occur at 

34 slot 7. There are no data dependencies between any of the accesses. The ReadModWrite 

35 command clears all fields therefore the background process can be overwritten by it. 



36 
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1 MX 2 CPU Interface 

2 This block is responsible for three different functions: 

3 • Connects between the blocks and the CPU via the CPU interface. It decodes the 

4 address to Figure which block to access and generate the chip select to each 

5 block. 

6 • Generates global synchronization signals to the rest of the blocks. 

7 • Holds general control registers like version register, mode register etc. 

8 Features 

9 • Address bus of 10 bits per chip. 

10 Bits 0-5 are used as an address inside the block (up to 64 registers per block) 

1 1 Bits 9-6 are used to select a block (up to 16 blocks per chip) 

12 • Data bus of 32 bits per chip. 

13 Externally this bus is b-directional 

14 Internally this bus is split to 2 busses: DATA_IN and DATA_OUT. 

15 • Chip select signal is the qualifier for data, address and read/write busses. 

16 • The CPU interface can support multiplexed CPU interface or non-multiplexed CPU 

17 interface. When using multiplexed interface, the address and data are folded to the 

18 DATA[31:0]bus. 

19 • Support little endian and big endian. 

20 • CPU accesses all registers in all blocks via direct read/write commands. 

21 • CPU accesses all internal/external memories in all blocks via indirect accesses using 

22 the general-purpose registers dedicated for these accesses. 

23 • Blocks addresses are shown in Figure 450. 

24 • Generates a global sync signal once every 520 cycles of the 200MHz clock (65 ports, 

25 8 cycles per port). 

26 • Multiplexes all test busses from the blocks to one output test bus. 
27 

28 Functional Description 

29 The CPU IF block is used to interface with the CPU. The CPU IF block decodes 

30 the incoming address bus to generate a unique chip-select signal to each block. The CPU 

31 IF block also holds general registers that contain general parameters and modes for the 

32 entire chip. Basically any logic that is not related directly to a specific block is in this 

33 block. 

34 The CPU IF block can operate in two modes depending on an input pins to select 

35 between them: 

36 • Regular CPU access: separate busses for data and for address. In this case address 

37 bus is qualified be the chip-select signal. 

38 • Multiplexed CPU access: one bus is multiplexed for address and for data. In this 

39 case address is qualified by the ALE signal. 

40 The CPU interface is based on Intel's microprocessor interface (when multiplexing 

41 address/data bus). The following timing diagrams defines the characteristics of the CPU 

42 interface for Read and for Write accesses: 
43 

44 Read access characteristics 

45 See Figure 451. 
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1 • RDWR_L signal has the same timings as the ADDR bus. 

2 • A valid read cycle is defined when RDWR_L is latched as high on the negative edge 

3 of the CS_L signal. 

4 • In non-multiplexed address/data bus architecture, ALE should be held high so 

5 parameters tsair, tHair, Wl, tsir and iuh are not applicable. 

6 • In our implementation the address is latched on the negative edge of the CS_L 

7 therefore tHar is not applicable. 

8 See Figure 452. 
9 

10 Write access characteristics 

11 See Figure 453. 

12 • RDWR_L signal has the same timings as the ADDR bus. 

13 • A valid write cycle is defined when RDWR_L is latched as low on the negative edge 

14 of the CS_L signal. 

15 •In non-multiplexed address/data bus architecture, ALE should be held high so 

16 parameters tsaiw, tnaiw, tvt, tsiw and tHiw are not applicable. 

17 • In our implementation the address is latched on the negative edge of the CS_L 

18 therefore tHaw is not applicable. 

19 See Figure 454. 
20 

21 Timing diagram of signals to blocks 

22 The following diagram describes the timings of the CPU interface to each block. 

23 Note that chip-select signal is now asserted for only one cycle of the 200MHz clock. For 

24 a write or read cycle, all parameters are valid and stable before chip-select is asserted. 

25 When a read cycle occurs data is driven from the block after a maximum of 3 clock 

26 cycles of 200MHz clock. 

27 See Figure 455. 
28 

29 Configuration modes 

30 The CPU interface has 2 configuration bits to control the CPU interface as shown 

31 in Figure 456. 
32 

33 GSYNC - Synchronization signal 

34 GSYNC signal is a one-cycle pulse in the 200MHz clock, which is asserted once 

35 every 520 cycles. The SYNC signal is used for synchronization between the blocks and 

36 between the chips as well as to generate the slot-count and the CPU-port indication. 

37 See Figure 457. 

38 Slot count and CPU port indication 

39 The signals slot_count and CPU_port are generated inside each block base on the 

40 GSYNC signal. Each block first needs to sample GSYNC and use the sampled version to 

41 generate the signals. See Figure 458. 
42 
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1 Test Multiplexing 

2 The CPU interface block receives test busses from each block and multiplex them 

3 to one output test bus. The width of all test busses is 32 bits. Each block has a control 

4 register to select the information to drive to the pins. The CPU interface block has a 

5 control register to select between the blocks. 

6 See Figure 459. 
7 

8 Interface 

9 The CPU IF block generates chip-select signal per block, the rdwrj, addr and 



10 data busses are propagated to the blocks directly. The blocks perform the read or write 

1 1 command based on the rdwrJ signal only when the chip-select signal per the block is 

12 asserted. The chip-select signal per each block is synchronous to the 200MHz clock 

13 domain already. Each block first samples the incoming signals and then uses them. This 

14 is being done to solve any timing issues that may occur. If during placement and routing 

15 the signals from the CPU IF to the blocks will violate the 5nSec budget, another set of 

16 FFs can be placed in the middle of the path. 

17 For a read cycle the CPU IF block selects the data busses coming in from the blocks 

18 based on the chip-select per block (the address). 



19 See Figure 460; software reset and testing signals are not included in the diagram. 

20 The address, data and rd/wr signals takes about 4 cycles of 200MHz to propagate from 

21 the chip's pins to the block (6 cycles are used for delay calculation). 2 cycles are needed 

22 to perform read or write accesses in the block. 

23 Therefore a total of 8 cycles are needed to perform a read or a write access. 
24 

25 Block Descriptions 

26 

27 CPU Interface 

28 The negative edge of the CS_L signal qualifies the ADDR[9:0] and the RDWR„L 

29 signals. The positive edge of the CS_L signal qualifies the DATA bus for read and for 

30 write accesses. The CS_L is basically an enable signal to latches. Figure 461 shows the 

31 basic latching circuit and the synchronization to 200MHz clock domain: 
32 

33 Registers 

34 Figure 462 describes the registers implemented in the CPU IF block. These 

35 registers act as control to the rest of the chip. The registers are implemented in the 

36 200MHz clock domain even if there is no need to do so since their value is fixed and 

37 stable during normal operation. 
38 

39 Appendix A: Tags for Lookup Engine 

40 The following types of traffic are supported on the Maximus chip Lookup Engine 

41 Block: 

42 1. ATM 

43 2. MPLS 

44 a. ATM 

45 b. PPP 

46 c. Ethernet 
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1 d. Frame Relay 

2 3. Ethernet 

3 4. IP Frame Relay 

4 (All bit numbering assumes Big Endian ordering) 

5 See Figure 268. 

6 For the first version of the chip, the Flow Descriptor(FD) in Lookup Engine block 



7 will support Data Types 1 and 2(a) thru 2(d). Pure Ethernet, IP and Frame Relay will be 

8 supported in the next version of the Maximus chip. Header format references for these 

9 traffic types are given below: 
10 

1 1 HEADER FORMATS 

12 1. ATM - Tag in the Link Layer (carried in 28 bit fields - 12 bit VPI/16 bit VCI) 

13 (Computer Networks, 3"" Edition, Andrew Tannenbaum, pg. 45 1) 



14 See Figure 269. Each field is 8 bits wide with 12 bit VPI and 16 bit VCI field, for 

15 a total of 28 bits lookup tag. This 28 bit lookup tag is located in the 1'' 64-bit word, bits 1 

16 thru 28. 
17 

1 8 2(a) MPLS - ATM, 2(b) MPLS-PPP, 2(c) MPLS-Ethernet arid 2(d) MPLS-FR 

19 (Tag is always between the Link and the Network Layer) 

20 (MPLS, Technology & Applications, Davie & Rekhter, pg. 132) 

21 The MPLS Header is inserted between the Link Layer and the Network Layer of 

22 each of the above four protocols. The detailed Link Layer description of each of the 

23 protocols is given below in other sections of this document. 

24 The MPLS field is the same for the four different Link Layer protocols. The exact 



25 location of this 20-bit MPLS label in the Frame varies with the Link Layer used, and is 

26 summarized in the table above. With 20 bits, the MPLS label allows having 2^° possible 

27 tags. See Figure 270. 
28 

29 3. Ethernet -Tag in the Link Layer 

30 (Computer Networks, 3"* Edition, Andrew Tannenbaum, pg. 281) 

31 Ethernet Destination Address(DA) field is used in the Lookup Engine Block. 

32 These are the 48 bits after the first 64 bits as shown below. See Figure 27 1. 
33 

34 4. IP - Tag in Network Layer 

35 (Computer Networks, 3"* Edition, Andrew Tannenbaum, pg. 413) 

36 The IP Packet Destination Address is used for Lookup Engine. Starting from the 

37 the 3"* 64 bit word, the Destination Address is 32 bits field and is used in the Lookup 

38 Engine. This assumes Big Endian Transmission. See Figure 272. 
39 

40 5. Frame Relay (FR) - Tag in Link Layer (carried in 10 bit DLCI field) 

41 (Frame Relay Technology & Practice, Buckwalter, pg. 39) See Figure 273. 
42 

43 6. Point to Point Protocol (PPP) 

44 (Computer Networks, 3"* Edition, Andrew Tannenbaum, pg. 232) See Figure 274. 
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1 

2 Appendix B: Flow Control Mechanism of CSIX-Ll 

3 CSK provides multiple levels of flow control and supports flow control in both 

4 directions. At ingress, flow control is fabric-to-TM; at egress, flow control is TM-to- 

5 fabric. 
6 

7 Link-level Flow Control 

8 The link level flow control is symmetric across the CSEX interface and provides 

9 independent control for data and control queues. There are 2 link level queues. One queue 

10 is for control traffic and the other queue is for data traffic. For each queue there is a 

1 1 ready bit in every header indicating the congestion status for the receive queue of the 

12 respective traffic type. See Figure 275. 
13 

14 Fabric Flow Control 

15 The fabric flow control provides a finer level of flow control than link level flow 

16 control by specifying the specific TM and Class that are oversubscribed. Flow control on 

17 a TM port (line-end or VC) basis is the responsibility of cooperating TMs. Flow control 

18 frames can go in both the ingress and egress directions. See Figure 276 and 277. 
19 

20 Appendix C — MX2 Software Interface: A User Guide to Software 

21 Interface 

22 This document describes the software interface to the MX2 chip via the CPU 

23 interface. This document should include all atomic CPU commands as well as the high 

24 levl commands. 

25 There are two types of conmiands: 

26 1 . Low-level commands, when issued, the driver generates a single command to access 

27 registers or internal/external memories in one specific block. 

28 For example read scheduler's external memory 

29 2. High-level commands, when issued, the driver generates several commands to access 

30 registers or internal/external memories in several blocks. 

31 For example setup connection command to initialize FID database in several 

32 blocks. 
33 

34 Features 

35 1. the interface supports up to 16 different blocks in the chip, each block has up to 64 

36 registers. 

37 2. CPU accesses all registers in all blocks via direct read/write commands. 

38 3. CPU accesses all internal/external memories in all blocks via indirect accesses using 

39 the general-purpose registers dedicated for these accesses. 

40 4. CPU interface can support multiplexed address/data bus or non-multiplexed. 

41 5. CPU interface can support big endian or little endian modes. 
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1 

2 Interface 

3 

4 Physical Interface 

5 The CPU interface to the MX2 chip is via standard CPU interface that contains 

6 signals as shown in Figure 463. The software, using this interface, can directly access the 

7 registers inside each block and can indirectly access internal and external memories per 

8 each block. 
9 

10 Address spaces per block/chip 

1 1 The 4 MSBs of the address bus points to the specific block, while the 6 LSBs of 

12 the address points to internal registers inside the block. Each block has address space of 

13 64 entries only. The internal registers of each block are directly mapped to these 

14 addresses and external/internal memories are indirectly mapped. The MX2 chip can 

15 support up to 16 blocks with 64 registers each. Figure 464 is the allocation of address 

16 spaces per block. 
17 

18 Direct Access 

19 A direct access is used to access registers only. The only two commands used in a 

20 direct access are read and write. Each register has it's own unique address inside the chip. 
21 

22 Indirect Access 

23 An indirect access is used to access internal or external memories. Each block has 

24 a set of registers dedicated for the indirect access. The COM register contains an Op- 

25 code and address of the memory to access. Registers RO through Rn are used to contain 

26 the data. 

27 Indirect Write Command 

28 A write conmiand starts with writing the data to registers RO to Rn and then 

29 writing the opcode and the address to the COM register. The opcode defines which 

30 memory to access and if it is a read or write command. 

31 The hardware then, takes the data from registers RO through Rn and write them to a 

32 specific address defined in the COM register in the specific memory defined by the 

33 opcode. 

34 Once the command is complete, the hardware resets the opcode bits in the COM 

35 registers. It is guarantee that aread/write command to internal/external memory will be 

36 complete after 12 cycles of the operating clock (@200MHz it means 60nSec). 

37 Indirect Read Command 

38 A read command starts with writing the opcode and the address to the COM 

39 register. The opcode defines which memory to access and if it is a read or write 

40 command. 
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1 The hardware then, uses the address to perform a read access the memory. Once 

2 the data arrives the hardware assign the data to registers RO through Rn and then clears 

3 the opcode bits in the COM register. 

4 The software then, generates direct reads to registers RO through Rn to get the 

5 data. It is guarantee that aread/write command to internal/external memory will be 

6 complete after 12 cycles of the operating clock (@200MHz it means 60nSec). 
7 

8 CPUIF Block 

9 CPUIF block doesn't have any internal or external memories, therefore, only 
10 direct commands to registers are possible in this block. 

11 

12 Registers 

13 See Figure 465. 
14 

15 Commands 

16 N/A 
17 

18 PFQ Block 

19 

20 Registers 

21 See Figure 466. 
22 

23 Commands 

24 Read FID Enqueue Memory (72 bits) - Up to 2M locations 

25 See Figure 467. 

26 Write FID Enqueue Memory (72 bits) - Up to 2M locations 

27 See Figure 468. 

28 Read FID Dequeue Memory (36 bits)- Up to IM locations 

29 See Figure 469. 

30 Write FID Dequeue Memory (36 bits) - Up to IM locations 

31 See Figure 470. 

32 Read BID Memory (36 bits) - Up to 16M locations 

33 See Figure 471. 
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1 Write BID Memory (36 bits) - Up to 16M locations 

2 See Figure 472. 

3 Read Stat Memory (72 bits) - Up to 2M locations 

4 See Figure 473. 

5 Write Stat Memory (72 bits) - Up to 2M locations 

6 See Figure 474. 

7 Setup connection command for FID 

8 See Figure 475. This command sets up a connection by accessing external 

9 memories and initializes the corresponding locations to the proper parameters. It is a 

10 write command. Clear PFQ entries for ENQ, DEQ and STAT! 

1 1 Tear-Down connection command for FID 

12 See Figure 476. This command tears down a connection by accessing external 

13 memories. It is a write command. The data is all "0" PFQ returns all remaining BIDS on 

14 the queue of the down connection to the free link list during this command. 

15 INIT FID MEMORIES 

16 The memories are the enqueue, dequeue, and statistics memories. See Figure 477. 

17 INIT BID Memory 

18 This will build the free buffer list. See Figure 478. 

19 Reserved Op-codes 

20 The op-codes 1101,11 10, and 1 1 1 1 are reserved. The following commands are 

21 combinations of the above simple commands. A user command like this should be broken 

22 to several Read or Write commands in the driver. 

23 Initialized External memories 

24 • Write "0" in all memory locations. 

25 • Write "NULL" in the enqueue memory. 

26 • Write "NULL" in the dequeue memory. 

27 • Build the free buffer list by linking each buffer to the next one. 
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Initialized Internal memories 



2 • Write "NULL" in the Multicast Root memory. 

3 • Write "NULL" in the Multicast Leaf memory. 

4 • Write "NULL" in the Tunneling Root memory. 

5 • Write "NULL" in the Tunneling Leaf memory. 
6 

7 DataBase Block 

8 

9 Registers 

10 See Figure 479. 
11 

12 Commands 



13 Read external FID memory 



14 



See Figure 480. 



15 A[0] - C_P (bit [6] in the register) 

16 A[l] - CPU_PORT (bit [7] in the register) 

17 Pri[2:0] - Priority for shaped traffic 

18 S - Shape (bit [23] in the register) 

19 C[0] - dpi (bit [10] in the register) 

20 C[l] - Empty (bit [1 1] in the register) 

21 Write external FID memory 

22 See Figure 481. 

23 A[0] - C_P (bit [6] in the register) 

24 A[ 1] - CPU_PORT (bit [6] in the register) 

25 Pri[2:0] - Priority for shaped traffic 

26 S - Shape (bit [23] in the register) 

27 C[0] - dpi (bit [10] in the register) 

28 C [ 1 ] - Empty (bit [ 1 1 ] in the register) 

29 Setup FID connection 

30 See Figure 482. 

3 1 A[0] - C_P (bit [6] in the register) 

32 A[ 1 ] - CPU_PORT (bit [6] in the register) 

33 Pri[2:0] - Priority for shaped traffic 

34 S - Shape (bit [23] in the register) 

35 During the setup connection command, the user defines the above fields only. 

36 The fields RP, WP, NCELL, CELL_CNT and CLP should be reset to 0 by the hardware 

37 The field EMPTY should be set to 1 by the hardware. 
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1 Read external PKT memory 

2 See Figure 483. 

3 L[0] - CLP 

4 L[l] - Reserved 

5 Write external PKT memory 

6 See Figure 484. 

7 L[0]-CLP 

8 L[l] - Reserved 

9 Read port calendar memory 

10 See Figure 485. 

11 Data[5:0] - portid 

12 Data[6] - valid 

13 Data[7] - jump 

14 Write port calendar memory 

15 See Figure 486. 

16 Data[5:0] - portid 

17 Data[6] - valid 

18 Data[7] - jump 

19 Read SHP Strict memory 

20 See Figure 487. 

21 P[2:0] - previous QOS pointer 

22 P[3] - previous QOS pointer valid 

23 Write SHP Strict memory 

24 See Figure 488. 

25 Read SHP out memory 

26 See Figure 489. 

27 Write SHP out memory 

28 See Figure 490. 



162 



AZA-OOl/2001-POO 




PATENT 



1 Init SHP_out_port memory 



2 See Figure 491. This command generates write accesses to all entries in the 

3 Shp_out memory (64 entries) with a specific data of: p_rp = 0, p_wp - 0, p_e = 1 

4 Init Shp_strict memory 

5 See Figure 492. This conmiand generates write accesses to all entries in the 

6 Shp_strict memory (64 entries) with a specific data of: empty = 8'bl 1 1 1_1 1 1 1 , 

7 prev_qos_v = 0, prev_qos = 0 

8 Reserved Opcodes 

9 The opcodes: 1 1 10, 1 1 1 1 are reserved. 
10 

11 Shaper/Meter Block 

12 

13 Registers 

14 See Figure 493. 
15 

1 6 Commands for Shaper mode 

17 Read FIDl memory 

18 See Figure 494. 

19 P - peak_sustain 

20 Registers R2, R3 are used in case of dual-leaky-bucket only. 

21 Write FIDl memory when using shaper 

22 See Figure 495. 

23 P - peak_sustain 

24 Registers R2, R3 are used in case of dual-leaky-bucket only. 

25 Read FID2 Memory 

26 See Figure 496. 

27 SBT[0] - start (bit 20 in the register) 

28 SBT[2:1] - bg_stt[l:0] (bits [22:21] in the register) 

29 SBT[3] - time_msb (bit [23] in the register) 

30 Write FID2 Memory 

31 See Figure 497. 

32 SBT[0] - start (bit 20 in the register) 

33 SBT[2:1] - bg^stt[l:0] (bits [22:21] in the register) 
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1 SBT[3] - time_msb (bit [23] in the register) 

2 Read SHP_SLOT memory 

3 See Figure 498. 

4 E - empty indication (bit [20] in the register) 

5 Write SHP_SLOT memory 

6 See Figure 499. 

7 E - empty indication (bit [20] in the register) 

8 Read RATEJD memory 

9 See Figure 500. 

10 Ks[17:0] - Ks Mantissa, Ks[22: 18] - Ks Exponent (same for Kp) 

1 1 Write RATEJD memory 

12 See Figure 501. 

13 Ks[17:0] - Ks Mantissa, Ks[22:18] - Ks Exponent (same for Kp) 

14 Setup connection command 

15 See Figure 502. 

16 This command resets all fields in FIDl and FTD2 memories. Except the back_count field 

17 in FID2 memory, which is driven from the command itself. 

18 Also bg_stt[l:0] and empty bits should be set to 1 during this command. 
19 

20 Commands for Meter mode 

21 Read FIDl memory 

22 See Figure 503. 

23 M - Mark bit (bit [27] in the register) 

24 Cur[3:0]-bg_cnt 

25 Cnt[3:0] - bg_cnt_value 

26 Write FIDl memory 

27 See Figure 504. 

28 Read FID2 Memory 

29 See Figure 505. 



164 



AZA-001/2001-P00j^ ^ PATENT 

1 Write FID2 Memory 

2 See Figure 506. 

3 Read Threshold (internal) Memory 

4 See Figure 507. 

5 Write Threshold (internal) Memory 

6 See Figure 508. 

7 ReadModWrite Meter 

8 See Figure 509. 

9 M - Mark bit (bit [27] in the register) 

10 Cur[3:0] - bg_cnt 

1 1 Cnt[3:0] - bg_cnt_value 

12 This command triggers a write back to the same location clearing all values to 

13 zero (except the bg_cnt_value). That's is why this command has a different opcode then a 

14 simple read. 

15 Setup connection command 

16 See Figure 510. This command writes the following data to FIDl memory: 

17 celLcnt <= 0, mark <= 0, bg_cnt <= cnt[3:0], bg_cnt_value <= cnt[3:0] 

18 Reserved Op-codes 

19 The op-codes 1000, 1001, 1010, 1011, 1100, 1101, 1110, 1111 are reserved. 
20 

21 Scheduler Block 

22 

23 Registers 

24 See Figure 511. 

25 Commands 

26 Read FID NEXT Memory 

27 See Figure 512. 

28 Write FID NEXT Memory 

29 See Figure 513. 
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1 Read QOS Parameters 

2 See Figure 514. 

3 Write QOS Parameters 

4 See Figure 515. 

5 Read QOS Descriptor 

6 See Figure 516. 

7 Write QOS Descriptor 

8 See Figure 517. 

9 Read PORT Parameters 

10 See Figure 518. 

1 1 Write PORT Parameters 

12 See Figure 519. 

1 3 Read PORT Descriptor 

14 See Figure 520. 

1 5 PQ[0] = PREV_QOS_VALID 

16 PQ[3:1] = PREV_QOS[2:0] 

17 PQ[6:4] = ACTIVE_PTR[2:0] 
18 

19 Write PORT Descriptor 

20 See Figure 521. 

21 Read PORT Next 

22 See Figure 522. 

23 Write PORT Next 

24 See Figure 523. 
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1 Init QOS 

2 See Figure 524. This command write the parameters to the QOS Parameter 

3 Memory and also initiates a write to the QOS Descriptor Memory with the following 

4 values: 

5 F_EMPTY = 1 

6 Rest of the fields should be reset to zero 

7 Init port 

8 See Figure 525. This conmiand write the parameters to the Port Parameter 

9 Memory and also initiates a write to the Port Descriptor Memory with the following 

10 values: 

1 1 QW_EMPTY[7:0] and QA_EMPTY[7:0] = 8'bl 1 1 1_1 1 1 1 

12 Rest of the fields are reset to zero. 

13 Reserved Op-codesReserved Op-codesReserved Op-codes 

14 The op-codes 11 11 are reserved. 
15 

16 Driver^s high level commands 

17 The following section describes higher level commands then just read/write to 

18 internal/external memories. The following commands are basically combinations of the 

19 above commands and are created to give easier user interface to the chip. 

20 These commands doesn't have special opcode in the chip itself, they are implemented in 

21 the driver only. 
22 

23 Setup Connection Command 

24 This command sets up a connection by accessing external memories in several 

25 blocks and initializes the proper parameters per FlowED. 

26 The following table describes the parameters per FlowID that should be set during 

27 the setup connection command, for more specific information about the fields, please 

28 refer to the blocks SPECs in Figure 526. 

29 Once the driver receives a setup connection command, the driver generates setup 

30 connection command to the DBS, SHP and PFQ blocks. The parameters to the setup 

31 connection commands to the blocks are driven from the user along with the global setup 

32 connection conmiand to the driver. 

33 ' 

34 Tear-down Connection Command 

35 A tear-down-connection command is used only in the PFQ block. This command 

36 is needed to invalidate some fields in the FE) descriptor memories as well as the statistics 

37 fields. The Database, Shaper and Scheduler blocks don't need to invalidate any of the 

38 FED fields. Therefore, there is no special driver command for this conmiand. The user can 

39 initiate a simple write command to the PFQ to implement this conmiand. 
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1 

2 Power-up initialization 

3 When reset is applied to the chip (or power-up) the internal control registers wakes up 

4 in their default value, all traffic is disabled and internal/external memories are not 

5 defined. The following is the initialization sequence the user should perform before 

6 allowing any traffic to the MX2 chip: 
7 



8 


1. 


PFQ- 


issue "Init FID Memory" command 


9 


2. 


PFQ- 


issue "Init BID Memory" command 


10 


3. 


PFQ- 


initialize multicast internal memories (if needed) 


11 


4. 


PFQ- 


initialize tunneling internal memory (if needed) 


12 


5. 


PFQ- 


specify bits in control register if different then default (address 36) 


13 


6. 


PFQ- 


initialize RED internal memory only if using RED 


14 


7. 


PFQ- 


initialize class threshold registers (addresses 45 to 52) only when usin 


15 




CLASS 


16 


8. 


DBS- 


initialize Port Calendar memory 


17 


9. 


DBS- 


• issue "Init Shp_strict Memory" command 


18 


10. 


DBS- 


• issue "Init Shp_out_port Memory" command 


19 


11. 


CPUIF - define Shaper/Meter mode (register 1 bit [8]) 


20 


12. 


SHP- 


initialize RatelD/Threshold memory 


21 


13. 


SHP- 


issue "Init Slot Memory" command 


22 


14. 


SCH- 


■ initialize Port Descriptor Memory 


23 


15. 


SCH- 


- initialize QOS descriptor Memory 


24 


16. 


<setup 


connections if needed> 


25 


17. 


PFQ- 


enable input traffic for specific ports by writing to registers 43 and 44 


26 


18. 


SCH- 


- enable output traffic from scheduler (register 32 bit [0]) 


27 


19. 


SHP- 


enable output traffic from shaper (register 32 bit [1]) 



28 

29 Once these steps are complete, the user can initiate setup connection conmiands. 
30 

31 The user doesn't have to go through all of the above steps, for example, if tunneling is 

32 not used then there is no need to initialize the internal memory for tunneling in the PFQ 

33 block.the MX2 Chip 
34 

35 Appendix D — ^MX2 External Memory 

36 See Figure 527. 
37 

38 While this invention has been described in terms of several preferred embodiments, 

39 there are alterations, permutations, and equivalents, which fall within the scope of this 

40 invention. It should also be noted that there are many alternative ways of implementing 

41 the methods and apparatuses of the present invention. For example, the present invention 

42 may be practiced by integrating line card functionality with switch fabric functionality 

43 such that a router does not have individual line cards. Alternatively, router in accordance 

44 with the invention may include many lines cards deploying the Maximus device, some of 

45 the line cards functioning as ingress line cards, others of the line cards functioning as 

46 egress line cards. It is therefore intended that the following appended claims be 



168 



AZA-001/2001-P00^ ^ PATENT 



1 interpreted as including all such alteration, permutations, and equivalents as fall within 

2 the true spirit and scope of the present invention. 
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